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ABSTRACT 

At 2 > 1, the distinction between merging and “normal” star-forming galaxies based on single band 
morphology is often hampered by the presence of large clumps which result in a disturbed, merger-like 
appearance even in rotationally supported disks. In this paper we discuss how a classification based on 
canonical, non-parametric structural indices measured on resolved stellar mass maps, rather than on 
single-band images, reduces the misclassification of clumpy but not merging galaxies. We calibrate the 
mass-based selection of mergers using the MIRAGE hydrodynamical numerical simulations of isolated 
and merging galaxies which span a stellar mass range of and merger ratios between 

1:1-1:6.3. These simulations are processed to reproduce the typical depth and spatial resolution 
of observed Hubble Ultra Deep Field (HUDF) data. We test our approach on a sample of real 
z~2 galaxies with kinematic classification into disks or mergers and on ^100 galaxies in the HUDF 
field with photometric/spectroscopic redshift between 1.5 ^ z ^ 3 and M > 10^-^Mq. We find that a 
combination of the asymmetry Amass and M 2 o,mass indices measured on the stellar mass maps can 
efficiently identify real (major) mergers with < 20% contamination from clumpy disks in the merger 
sample. This mass-based classification cannot be reproduced in star-forming galaxies by i?—band 
measurements alone, which instead result in a contamination from clumpy galaxies that can be as high 
as 50%. Moreover, we find that the mass-based classification always results in a lower contamination 
from clumpy galaxies than an iJ—band classihcation, regardless of the depth of the imaging used (e.g., 
CANDELS versus HUDF). 

Subject headings: high-redshift-galaxies: interactions- galaxies: irregular-galaxies: structure 


1. INTRODUCTION 

Two decades of Hubble Space Telescope (HST) 
optical/near-infrared (NIR) observations - and more re¬ 
cently also ionized or molecular gas data - have un¬ 
veiled a complexity of morphologies in high redshift 
star-forming galaxies. Many z>l galaxies do not dis¬ 
play the disk or spheroidal morphology that is observed 
in the majority of local galaxies but are instead char¬ 
acterized by giant, star-forming clumps which domi¬ 
nate the light profiles and result in largely asymmet¬ 
ric appearances llCowie et ^ll99^lPaD0vich et al.ll2005l: 


Elmegreen et al.l '2007t iLaw et al.M200A Swinbank et all 


201ft Forster Sc hreiber et al.l 120111 : iGuo et al.l l2012t 
Tacconi et al.l I2013H . Although such irregular struc¬ 


ture has often beeii asso c iated with merge rs events 
(e.g., iGonselice et al.l [20081 iLotz et ^l2008al l. the use 
of NIR, integral-field (IFU) spectroscopy has enabled 
substantial progres s in the classif i cation of high red¬ 
shift galaxies (e.g. , 
lEpinat et al.l I2009L 


Erb et all 12004 IShapiro et ahl 120081 : 
201211 a:^ detailed kinematic anal¬ 


ysis have revealed ordered rotational moti on in a large 
number of these visually disturbed galaxies (iGenzel et ahl 
2006t iBournaud et al.ll2008t Ivan Starkenburg et al.ll2008t 


Forster Schreiber et ahl 1200911 . The formation of giant 
clumps in “normal” disk galaxies is thought to be the 
outcome of violent disk instability and fragmentation 
(jNoguchill 19991 IBournaud et al.l[2007t lAgertz et al.ll2009l : 


iDekel et'aril2009HGeverino et al.ll2010ll . developing as a 
consequence of t he high gas fractions that are typical for 


distant galaxies (IDaddi et al.ll20I0l 

Tacconi et all 120101: 

Magdis et all 120121: ISaintonge et all 

20131 ISargent et all 

2014{). 


In the lack of resolved kinematic data, the distinction 
between merging galaxies and clumpy disks remains how¬ 
ever an observational challenge. In this Paper we show 
how a quantitative classification performed on resolved 
stellar mass maps, rather than optical or NIR single-band 
images, can help disentangling the population of truly 
merging galaxies from that of clumpy disks even without 
available IFU observations. Our approach is motivated 
by a number of reasons. 

Even with the currently available second-generation 
instruments, obtaining reliable IFU kinematic measure¬ 
ments still requires major telesco pe time investments (se e 
e.g., the KMOS-3D campaign, iWisnioski et al.l [2015f) . 
Therefore, techniques which can provide robust proxies 
for the full kinematic informations are necessary. 

Ganonical merger classification methods, however, suf¬ 
fer from limitations which can hamper the distinc¬ 
tion between mergers and “normal-but-clumpy” galax¬ 
ies. The select ion of close pairs in the spatial and ve¬ 
locity domain (iBarton et ^ 120001: iGarlberg et ahl I2OOOI: 
Ellison et al.ll2008l:|de Ravel et al.ll2009HKampczvk et al l 

20131 : iPipino et ahl 12014 among the others), for exam- 
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pie, identifies by definition physically associated systems 
but it can be biased against very close galaxies (“fiber 
collision”) and thus late interaction stages, namely the 
merger phases in which the distinction between mergers 
and clumpy disks becomes more hazy. Close kinematic 
pairs would typically have a large enough separation to 
allow the individual morphological classification of each 
galaxy. 

Another common way of identifying mergers re¬ 
lies on the degree of irregularity in the light distri¬ 
bution either through non parametric measures such 


and Gini-M9n indices (lAbraham et al.l 

20031: IConselice 

2003; 

Lotz et al.l 12004 Law et al.l 12007 

: IScarlata et al. 

2007; 

Conselice et al.l 120081) or throug 

1 other indica- 
onents/tidal in- 
dge et al.l 120101: 

tors of the prese 
teractions (iKam 

nee of multiple comp 
Dczvk et al.l 120071: IBri 

Kartaltene et al.l 

2 OI 2 I: iLackner et al.l 2014). Using hv- 


have shown that combinations of non-parametric struc¬ 
tural estimators (G-A-M 20 ) are sensitive to the coales¬ 
cence phase and thus can be used also for evolved merg¬ 
ers. As mentioned above, however, at 2 > 1 the light pro¬ 
files are dominated by giant star-forming clumps even in 
regular disk galaxies and CAS-like classification schemes 
applied to single-band optical/NIR images typically fail 
in di stinguishing mergers from non -interacting galaxies 
(e.g., IHuertas- Company et al.l[2014ll . 

While the giant IO^Mq clumps contribute to 20%- 
50% of the flux in resolved optical/UV or star forma¬ 
tion rate (SFR) maps of > star-forming galaxies, 

they show a lower contrast with respect to the underly¬ 
ing disks on stellar mass maps, contributing to < 10% 
of the total mass budget (iForster Schreiber et al.ll2011l : 
IWuvts et al.ll201^ iCuo et al.ll2012ll . This suggests a po¬ 
tentially lower contamination of falsely identified mergers 
if measuring the aforementioned non-parametric struc¬ 
tural indicators directly on the stellar mass maps instead 
of the single band images: for clumpy galaxies we expect 
the stellar mass maps to display a regular, centrally con¬ 
centrated profile, whereas for merging galaxies multiple 
components will be present, with no clear central mass 
concentration. 

The exploitation of resolved mass (and also SFR or 
age) maps has become a common method of investi¬ 
gating the phys ical properties of low and high-redshift 


galaxies (see e.g. Welikala et al 


^ n[2nnl[z 

Wiiesinghe et al. 201(11 : iGuo et al. 120121: IWuvts et 
2 OI 2 I: iLang et al.l 12014 iTacchella et al.l 1201511 but have 


ibetti et ^120091 


not been used so far for a quantitative merger classifica¬ 
tion. The goal of this paper is to assess the performance 
of such a classification. We mainly focus on the test¬ 
ing and calibrating the proposed mass-based selection of 
mergers and defer a more in depth discussion on the prop¬ 
erties of mass-identified mergers to a forthcoming paper 
(A. Cibinel et al. in preparation). 

We use a set of merg ers and isolated gal axies from the 
MIRAGE simulations (jPerret et al.l 120141 to quantita¬ 
tively determine the efficiency and time scales probed 
when selecting mergers with mass-based structural pa¬ 
rameters. Although other studies have investigated the 
morphology of interacting galax ies in numerical sim¬ 
ulations (e.g., iLotz et all l2008bll . the use of the MI¬ 


RAGE sample enables us to make steps forward with 
respect to these previous analysis. The MIRAGE sim¬ 
ulations suite includes in fact key physical processes 
that are paramount for the formation and regulation of 
the giant star-forming clumps and thus naturally repro¬ 
duces the complexity of clump-dominated morphologies 
of high redshift galaxies. We then apply our classifica¬ 
tion scheme on a fiducial sample of 1.5 ^ 2 : ^3 galaxies 
in th e Hubble Ultra Deep Field (HUDF, iBeckwith et al.l 
120061) and compare the results of our new method with 
the iJ—band classification and also with previously pub¬ 
lished kinematic analyses. 

Specifically, the paper is organized as follows. We 
present in Section [2] the observational data utilized in 
our analysis, the sample basic properties and the gener¬ 
ation of the resolved mass maps for the HUDF galaxies. 
Section [3] presents the MIRAGE simulations and post¬ 
processing of the simulation output. We discuss our def¬ 
inition of a merging galaxy and some caveats regarding 
the simulations in Section |4l We provide a summary 
of the structural measurements performed on both real 
and simulated galaxies in Section [S] In Section |6] we 
calibrate the classification performed on the mass maps 
using the MIRAGE simulations and ancillary data with 
kinematic information. We then compare in Section [7] 
the mass-based classification and the standard iJ—band 
classification for the real HUDF galaxies. Finally, Sec¬ 
tion [8] summarizes our findings and conclusions. Consid¬ 
erations about signal-to-noise (S/N) effects and possible 
systematic biases are presented in Section 12.51 and Ap¬ 
pendix 

All magnitudes are in the AB system an d corrected for 
galact ic absorption using the dust maps of iSchlegel et al.l 
(|I998[) when necessary. Throughout the paper we use 
interchangeably the notation “iJ-band” to refer to the 
HST/Wide Field Camera 3 (WFC3) FI60W filter. If 
needed, we also use the abbreviations 6, z and Y when re¬ 
ferring to the HST /Advanced Camera for Surveys (ACS) 
F435W, HST/ACS F850LP and HST/WFC3 FI05W 
filters, respectively. Quoted masses assume a Chabrier 
initial mass function (IMF). We finally note that we 
will sometimes use the notation “clumpy disks” to re¬ 
fer those galaxies that have a clumpy appearance in the 
iJ—band/optical images but are not classified as merg¬ 
ers with our method. We stress however that this is not 
meant to be a quantification of the intrinsic strength of 
the bulge component in these galaxies. 

2. OBSERVATIONAL DATA 

We apply the new classification scheme on a sample of 
galaxies in the HUDF field which, thanks to the avail¬ 
ability of extremely deep observations but also medium 
and shallow coverage over the same area, enables us to 
generate high accuracy mass maps as well as to assess 
the impact of S/N on the classification. We briefly sum¬ 
marize here the sample selection criteria and the relevant 
information for the public data sets utilized in this study, 
referring the reader to the original works for further de¬ 
tails. 


2.1. Archival HST Observations 

To build the pixel-by-pixel mass maps and perform the 
analysis described in Section [7l we exploited the data 
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Figure 1. Location of the galaxy sample on the mass versus SFR 
plane. The dashed line shows the locus of the z = 2 main se- 

§ uence of star-form ing galaxies (based on literature compilation in 
argent et al.| 201^ and the small gray points are all galaxies in 
the lGu^^^^ <120131^ GOODS-S photometric catalog with a pho¬ 
tometric redshift 1.5 ^3. The large green symbols correspond 

to the initial sample of 132 galaxies studied in the present work, 
prior to applying the size and magnitude selection of Section 12.51 
The final sample of 87 galaxies on which reliable mass maps could 
be derived after applying this selection is shown with the red sym¬ 
bols. Triangles indicate IR-based SFR (either from a combination 
of NUV-|-monochromatic 24/i.m flux density or from a fit to the 
mid-to-far IR SED). Circles are instead galaxies without reliable 
IR photometry and for which the SFR is thus estimated from the 
dust-corrected UV luminosity. Galaxies with very low SFR are 
identified as quiescent based on their bzH or zY JH colors and the 
upper limits on their IR fluxes. For these galaxies no dust extinc¬ 
tion is applied when computing the UV-based SFR and are hence 
imposed to lie below the main sequence. The sample considered 
here is mostly composed of normal (main sequence), star-forming 
galaxies. 

from several public campaigns covering the HUDF area 
with a multi-tiered approach. 

For galaxies in the original HUDF ACS field (3'x3') 
we generated two versions of the mass maps using either 
(1) the deep HST/kCS F4 35W, F606W. F775W and 
F850LP images from HUDF (|Beckwith et al.ir2006f) com¬ 
bined with the i75r/WFC3 F105W, F125 W and F160W 
images from the CANDELS -Deep survey (|Grogin et al.l 
[20TltlKoekemoer et al.ll2011ll . or (2) the F435W, F606W, 
F775 W and F850LP observa tions from the GOODS sur¬ 
vey (IGiavalisco et al.l 1200411 plus the CANDELS-Deep 
NIR photometry. 

For the sub-set of galaxies in the central 2'x2' region 
of HUDF (corresponding to the WFC3 field of view), 
we also generated a third mass map making use of the 
extremely deep observations available at all wavelengths: 
the optical HUDF images and the i7S'r/WFC3 F105W, 
F125W, F140W and F1 60W coverage from the HUDF09 
and HUDF12 surveys (jBouwens et al.l [20111 : lEllis et al.l 
1201311 . We specifically employed the mosaics provided by 
the HUDE12 team which are combined with the HUDE09 
datasets. In all cases, CANDELS observations are also 
used for the E814W filter. 


2.2. Parent Catalog, Photo-z and Stellar Masses 

The identification of the galaxy sample and the calcu¬ 
lation of photometric redshifts (photo-z) are based on the 
iJ-band selected, multi-wavelength catalog p ublished by 
the CANDELS team in the GOODS-S field (iGuo et al.l 
[Ml and references therein). The photometric data 
available in this compilation consists of imaging in 17 
medium and broad-band filters ranging from the t/-band 
to the IRAC 8fim channel. 

We derived photo-z and in tegrated g a laxy stellar 
masses for all galaxies in the iGuo et ^ (1201311 sam - 
ple as described in full details in iPannella et al.ll2014ll 
Briefly , photo-z were estimated from the IGuo et al.l 
( 2013|) photometric ca talog using the public code EAZY 


( Brammer et al.l 120081) an d a combination of th e stan¬ 
dard set of templates from lWhitaker et all (120111) . These 
photo-z reach an accuracy of Az = |zphot — -2zpec|/(l + 
Zspec) = 0.03. Published spectroscopic redshifts (spec-z) 
are available for about 30% of our final HUDE sample^ 
Eor galaxies with a secure spectroscopic measurement we 
considered the spec-z as the final redshift, whereas the 
photo-z was preferred in those cases in which only a ten¬ 
tative (or no spec-z) is given. We then obtained galaxies 
stellar masses through fitting of the spectral en ergy dis¬ 
tribution (SED) with FAST (iKriek et al.ll2009D, keeping 
the re dshift fixed and using a set of iBruzual &: Charl^ 
(|2003D stellar population models with delayed exponen¬ 
tially declining star formation histories. Dust absorption 
with a maximum of Ay = 4 was allowed in the fitting 
(|Calzetti et al.ll2000D . 

2.3. Sample Selection 

The init i al ga laxy sample was extracted from the 
IGuo et ahi (I2013D catalog by selecting galaxies over the 
HUDF area having a spectroscopic or photometric red¬ 
shift between 1.5^z^3 and an 77-band magnitude 
brighter than 77 ^ 26 mag. The redshift selection allows 
us to probe the rest-frame FUV to optical for all galax¬ 
ies; the luminosity cut is instead applied to ensure that 
a minimum signal-to-noise is reached in most pass-bands 
and thus a reliable photo-z estimate can be obtained. 
The exact value of 77 ^ 26 mag was chosen empirically 
by requiring that 80% of all GOODS-S galaxies, within 
the same redshift bin as the one here considered, have a 
S/N>3 in at least 10 of the pass-bands which are used in 
the photo-z calculation. This magnitude threshold also 
ensures that the sample lies com fortably ab o ve th e 50% 
completeness limit of the parent IGuo et Ml (|2013D pho¬ 
tometric catalog (77 = 26.6). 

The 77—band magnitude limit translates into a red¬ 
shift dependent mass completeness threshold. For a clear 
selection of the sample, we hence apply a further cut 
in stellar mass to include only galaxies above the com¬ 
pleteness value. Given that we are mostly interested 
in studying the properties of clump-dominated, star¬ 
forming galaxies, we consider in the following the mass 
completeness limit for star-forming galaxies. To derive 
this threshold we followed the procedure described in 

® In particular, we referred to the ESO compilation of 
GOODS/CDF-S spectroscopy, http://www.eso.org/sci/activities/ 
garching/projects/goods/MasterSpectroscopy.html which collects 
spec-z from several spectroscopic surveys which have covered (also) 
the HUDF field. 





























































4 


Cibinel et al. 



Figure 2. Illustration of the mass map obtained in Section 12.41 for a case exa mple of a galaxy with a merger-like appearance in the 
optical/NIR images and a disk-like morphology in the mass map (ID 12451 in the lGuo et al.ll2013l ~) catalog, photometric redshift z=2.75). 
From left to right we present the bzH (F435W, F850LP and F160W) composite image, the /f—band stamp and the mass map. In spite of 
displaying several, equally bright clumps in the Ff—band or bzH images, the stellar mass map reveals a single galaxy, with a mass profile 
centrally concentrated at the position of the red (bulge) component in the bzH image. 


e.g.. lPozzetti et all ()201(T[) . For each galaxy we estimated 
the mass Miim that it would have, keeping its mass-to- 
light ratio constant, if faded to the limiting magnitude 
H =26. We then calculated, at each redshift, the mass 
below which lie 90% of Mum in the 30% faintest galaxies 
- considered to be representative of the typical M/L of a 
galaxy close to the magnitude limit. The final complete¬ 
ness limit is set by the highest redshift here considered 
(z=3), corresponding to a value of M > IO^'^Mq for 
star-forming galaxies. The equivalent number for quies¬ 
cent galaxies would be M > IO^^'^Mq. 

After also rejecting galaxies which fall too close to the 
HUDF edges for reliable measurements, our initial sam¬ 
ple includes 132 galaxies with 1.5^z^3, H ^ 26 and 
We derived structural para meter s for all 
these galaxies and we provide them in Table ixn but the 
sample is further restricted for our final analysis as a re¬ 
sult of the reliability assessment of the stellar mass maps 
that we present in Section [275] and which is based on tests 
performed on this initial, larger sample. 

While deferring to a forthcoming paper the detailed 
analysis of the star formation properties of mass-selected 
mergers, we show in Figure [1] the position of this initial 
sample of 132 galaxies on the mass versus SFR plane for 
illustration purposes. Even above the mass completeness 
limit for quenched galaxies, the majority of the galaxies 
here considered lie on the locus of the so-called main se¬ 
quence of star formation (e.g., IBrinchmann et HI 12004 
iDaddi et al.ll2007l : lElbaz et'5]l2007D . The few galaxies 
with very low SFR (SFR < 1 Mq yr“^) have been identi¬ 
fied as quiescent fro m a combination of their hzH colors 
or YH Vz colors ('see lDaddi et al.ll2004bl : l^meron et ah! 
I2011D and the constraints coming from their IR flux up¬ 
per limits. These galaxies are by definition forced to lie 
below the main sequence by imposing no dust extinction 
in the calculation of the UV-based SFR. Our sample is 
thus representative of the typical population oi z 2 
star-forming galaxies. 

A summary of the properties for the full sample of 
132 galaxies and the classification into mergers and non¬ 
interacting galaxies from the stellar mass map analysis 
is given in Table [ATI 

2.4. Pixel-hy-pixel SED Fitting and Stellar Mass Maps 


As a first step for the generation of the resolved mass 
maps, we registered all the ACS and WFC3 tiles to the 
same resolution and pixel scale of the iJ-band which has 
the worst point-spread function (PSF) among the other 
available filters (^0."15). To do so, we created an indi¬ 
vidual PSF for each filter by stacking several unsaturated 
stars in the HUDF field and computed the convolution 
kernels to match the PSF of the iJ-bantfl. From the 
matched images, we then extracted postage stamps for 
each galaxy in our sample with a size equal to 3 times the 
iJ-band Kron radius and, as further justified in Section 
14.11 we cleaned from the stamps any nearby companion 
galaxy with a known spec- or photo-z. 

To derive pixel-based stellar masses and the actual 
mass maps, we extracted pixel-by-pixel SEDs within an 
elliptical aperture equal to the galaxy iJ—band Kron 
semi-major axis and fitted them with stellar popula¬ 
tion models. Given the relatively low flux in each in¬ 
dividual pixel, some degree of smoothing or binning is 
required to ensure a minimum S/N on most of the fil¬ 
ters and thus obtain reliable parameters from the pixel 
SED fitting. Several approaches have been used in 
the li terature to deal with S/N homogenization prob¬ 
lems (ISanders fc Fabianll20011 ICappellari fc CoDinll2003l: 

Ebeling et al.l 120061 : iWuvts et al.l 120121 : iCibinel et al l 
2013b[) . We opted here for the publicly available 
code ApAPT S MOOT ifl developed in iZibettil l)2OO90 and 
IZibetti et al.l (120091) . Whenever the S/N falls below 
a given threshold, this algorithm performs an adap¬ 
tive smoothing of the images by replacing the original 
pixel values with an average of the galaxy flux over 
larger and larger circular areas as the S/N decreases. 
Adaptsmooth features two useful options: (a) the 
smoothing of several images on the same scale lengths 
- necessary to derive self-consistent SEDs - can be easily 
performed and (b) the pixel identity is maintained, as 
opposed to binning schemes in which neighboring pixels 
are assigned a common value, effectively grouping them 

^ The HST PSF varies slightly across the field of view and this 
effect could be taken into account by selecting for each object a 
nearby star instead of using a common PSF for the entire field. 

However, we estimate that the error introduced by using a single 
PSF is comparable with the uncertainty associated to noise effects 
when using individual stars. 

® http: //www.arcetri.astro.it/~zibetti/Software/ADAPTSMOOTH.html 
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Figure 3. Results of the tests performed on the toy-model mass 
profiles to assess the ability of reconstructing the stellar mass distri¬ 
bution. For all models falling in any region of the the size (Kron ra¬ 
dius normalized to PSF) versus ft—band magnitude plane marked 
by the horizontal and vertical dashed lines, we calculated the me¬ 
dian absolute residual (A) and the dispersion (cr) between input 
and reconstructed mass maps. The region boundaries are selected 
such to have at least 5 models in each bin. Areas where A (solid, 
45 degrees CW shading) or a (dashed, 45 degrees CCW shading) 
are low are colored in green, whereas a red color indicates system¬ 
atic shifts and large scatter around the input models, i.e., a high 
uncertainty in the derivation of the mass maps. The colorbars at 
the top of the figure provide the mapping between colors and the 
absolute values of A and tr, in dex. The figure refers to models 
created to reproduce the typical depth of the optical HUDF and 
CANDELS-Deep NIR data, which is in between the three combi¬ 
nations of photometry here explored (see Section l2.1ll . 

together into a final “macro pixel”. 

The choice of the reference band(s) defining the 
smoothing kernels applied by Adaptsmooth is a trade¬ 
off between attaining the best S/N and maintaining spa¬ 
tial resolution: using the band with the lowest S/N 
will obviously degrades the resolution, while referring to 
the one with the highest S/N will likely result in noisy 
SEDs. Furthermore, also in the light of forthcoming 
analyses on the comparison between the resolved mass 
and SFR distribution, we are interested in detecting fea¬ 
tures such as giant star-forming clumps which may be 
intrinsically bright in one band but have a smaller flux 
contrast at longer wavelengths. A too broad smooth¬ 
ing on the red band may completely erase these struc¬ 
tures in the blue filters. After testing single or multi¬ 
ple bands smoothing, we found the optimal configura¬ 
tion by running Adaptsmooth on stacked images of all 
ACS and WFC3 stamps: this ensures that the smooth¬ 
ing is applied on those pixels where the majority of the 
bands reach a low S/N while preserving the structural 
variations in the different filters. Specifically, we run 
Adaptsmooth by requiring a minimum S/N = 5 on the 


stacked images and halting the adaptive smoothing when 
the averaging area reaches the maximum radius of 5 pix¬ 
els. The smoothing pattern thus obtained was then ap¬ 
plied to all available bands giving a median S/N ^5 also 
on the individual pixels for most filters. We note that 
we have tested that our results are not substantially af¬ 
fected by a different choice of the smoothing kernel (e.g., 
by applying the smoothing on the i?—band only). 

We fitte d the adaptively sniqothed pixel SED s with 
Le Phare (lArnou ts et al.l H999l : lllbert et'HI 120061) using 
the iBruzual fc C hariot ( 2003D synthetic spectral library 
with a IChabrien (|2003f) IMF and a delayed exponen¬ 
tial star formation history, ip oc (t/r^) exp(—t/r). The 
characteristic time scale r was let vary between 0.01 
and 10 Gyr in 22 steps and template ages were chosen 
between 100 Myr and the age of the Universe at the 
given redshift. We allowed three metallicity values in 
the fitting {Z = 0.2 Zq, Z = 0.4 Zq and Z = Zq) and fur¬ 
thermore applied internal dust extinction by assuming 
a Calzetti law and E{B — V) ranging between 0 and 
0.9 mag. We defined our fiducial pixel mass estimate as 
the median mass from the full probability distribution 
function from all templates, but our results would remain 
unchanged if we had used the mass from the best-fit tem¬ 
plate (i.e., minimum x^) instead. As a validation of the 
derived mass maps, we verified the consistency between 
the sum of the pixel-based masses and the integrated 
galaxy mass in Appendix lA.ll We find an agreement at 
the level of <0.1 dex between the two estimates. 

An example of the derived mass maps is given in 
Figure [21 where we compare the bzH composite im¬ 
age, the i?—band image and the mass map for a galaxy 
in our sample (ID 12451). We have intentionally cho¬ 
sen a galaxy which displays a different structure in the 
iif—band than in the mass map to clearly illustrate how 
at z > 1 i/—band light and mass are not equivalent trac- 
ers of morphology . This was already pointed out by 
IWuvts et ai.l(|20i^ ( see for example their Figure 2 which 
has two galaxies overlapping with our sample in Figure 
fTTl) and we will further demonstrate it in Section [71 

2.5. Systematic Uncertainties in the Reconstructed 
Mass Maps 

Numerous studied have shown that the ability of 
recovering intrinsic galaxy properties from observed 
flux distributions depends strongly on the image S/N, 
resolution and even o n the intrinsic properties of the 
galaxies themselves (iDisne ^ 119761: ISchwei^ 119791: 
Franx et al.l 119891: llnipev fc Bothunlll997t Truiillo et al l 


2001t Graham et al. I ^051: Cameron fc Driveil 

Bailin fc HarrisI ~ 2008t [Graham fc WorlevI 


2007 


2008 


Mailer et al.l 120091 : ICarollo et al.l I2013at ICibinel et al l 

2013aD . Our mass-based measurements will also be 


affected by similar limitations. For structural measure¬ 
ments performed on optical images, tests on artificial 
galaxies have demonstrated that it is possible to derive 
correction schemes that can largely account for the 
systematic biases in the estimates of galaxy structure 
- e. g., galaxy radii and c oncentrations - for both local 
(see ICibinel et al.l I2013al for an a pplication to z 0 
galaxies from the ZENS sample inICarollo et al.ll 2013bli 
and high redshift galaxies (see ICarollo et al. 201^ for 
an a pplication to the COSMOS survey. iScoville et al.l 
( 2001 . 
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We do not attempt here a derivation of similar cor¬ 
rections in the mass domain as this would require large 
suites of artificial mass maps and hence significant as¬ 
sumptions on the mass (and dust) distribution in merg¬ 
ing and non-merging galaxies. We can nonetheless per¬ 
form some tests which enable us to define the “bound¬ 
aries of applicability” of our method, i.e. the regimes 
where we can obtain reliable measurements. 

It is clear that for unresolved objects no meaningful 
mass reconstruction can be performed nor the individ¬ 
ual galaxies participating in the merger can be identi¬ 
fied. Likewise, the reconstruction of the mass distribu¬ 
tion becomes more and more difficult as the flux in the 
pixels reaches the surface brightness limit of the observa¬ 
tions. To derive a global magnitude and size limit below 
which we cannot reliably derive mass maps, we tested our 
SED-fitting and mass reconstruction technique on a set 
of artificial galaxies with known mass distribution drawn 
from our initial sample of 1.5 ^ z ^ 3. We describe these 
models in detail in Appendix IA.21 For each of the toy 
galaxies, we compared the mass profile reconstructed fol¬ 
lowing the procedure in Section [ 2 ^ with the input model 
and calculated the median of the residuals as well as the 
typical dispersion around the input model. 

The results of the comparison are presented in Figure[3] 
where we color code each region of the size versus magni¬ 
tude plane according to the mean absolute residual value 
and the dispersion of all models falling in that specific 
area: a red/orange color indicates high residuals/scatter 
in the reconstructed maps, green corresponds to low 
residuals/scatter. The Figure illustrates how the abil¬ 
ity of measuring reliable mass maps degrades as galaxies 
approach the resolution limit or reach low surface bright¬ 
nesses: at sizes rKron 5xPSF (roughly 15 pixels) and 
magnitudes H > 24.5 systematic shifts and/or large de¬ 
viations from the input model affect the measured mass 
maps 0 

We thus used the thresholds H ^ 24.5 and 
?’Kron>5xPSF to select galaxies with reliable mass 
maps, reducing the sample of 1.5 ^ z ^ 3 galaxies to 89 
objects. Two of these galaxies, ID 13508 and ID 11800, 
are strongly contaminated by a bright neighbor and 
have for this reason less robust mass maps. These cases 
are flagged in Table lAll and excluded from the sample 
used for the analysis in Section [T] We show this final 
sample of 87 galaxies with red points in Figure [T] Note 
that the H ^ 24.5 selection is incidentally the same cut 
that has be applied for a reliable vis ual morphological 
classi fications on the CANDELS fields (|Kartaltepe et all 

[Ml . 

3. THE MIRAGE SIMULATED GALAXIES 
3.1. Description of the Simulations 

The details on the techni cal aspects o f the M IRAGE 
simulations are presented in iPerret et al.l (|2014fl and fur¬ 
ther discussions on the physics implemented i n these sim- 
ulations can als o be fou nd in iBenand et al.l (j2013ll and 
iBournaud et al.l ()2014af) . Briefly, three closed-box disk 
models with a bulge-to-total fraction of 8 % in mass and 

^ The inferred magnitude limit refers to mass maps generated 
from artificial images matche d to the HUDF + CANDELS-Deep 
data, as described in ADDendix lA.2l For observations at a different 
depth this limit will scale accordingly. 


stellar masses of 10® ®Mq (simulation G3 in iPerret et al.l 
120141 1. 10 ^°-^Mq (G2) and (G1) were generated 

using an adapti ve mesh refin ement technique with the 
RAMSES code (|Tevssieill2002f) . These simulations reach 
a resolution of 7.3pc at the highest level of refinement 
and have initial stellar mass particles of 1 . 2 xlO^M 0 and 
1 . 7 xlO^M 0 in the bulge and disk components, respec¬ 
tively. The disk galaxies were evolved in isolation or 
merged with each other, effectively probing merger ra¬ 
tios of 1:1, 1:2.5 and 1:6.3. In order to construct a repre¬ 
sentative sample of galaxy mergers, four different orbital 
parameters were explored for each merger ratio combina¬ 
tion, resulting in a total of 20 mergers simulations (plus 
the 3 isolated disk models). 

For each simulation configuration we utilize in the fol¬ 
lowing a set of 16 snapshots separated by 40Myr each, 
covering an epoch from 200 Myr to 800 Myr from the sim¬ 
ulations initial conditions; for the merger models this cor¬ 
responds to follow the pre- and post-coalescence phases 
for roughly 30 0 Myr each (the coa lescence time is visually 
determined in iPerret et al.ll2014[l . 

The MIRAGE simulations feature several aspects 
which are paramount for a correct description of the ISM 
physics and thus for a robust comparison with the real 
data. First, the high level of grid refinement enables us 
to resolve Jeans-unstable regions within the disks, con¬ 
sequently we can not only model the disk fragmenta¬ 
tion and the formation of large clumps but also properly 
describe outflows and heating within the giant clumps 
themselves. 

Second, a physically motivated feedback model is im¬ 
plemented in th e simulations by coupling standard super¬ 
novae feedback (iDubois fc Tevssi^l2008fl with the novel 
recipe for photoionization and ra diation pressu r e feed - 
back from OB-stars developed in iRenaud et al.1 (|2013ll . 
This feedback model reproduces the typical outflows, 
star formation rates and the stellar population ages 
(5,200 Myr) observed in real clumps (jBonrnaud et al.l 
1201 4all . 

Third, the simulated disks have an initial total gas 
fraction of fg = 65% which is well representative of the 
observed high molecul ar fractions in typi c al z > 1 , star- 
formi ng galaxies (e.g., iDaddi et al.ll201(Tl : iTacconi et al.l 
l 2010 fl and is essential for the onset of gravitational insta¬ 
bility and the generation of the giant clumps. 

Finally, the MIRAGE simulations were originally de¬ 
signed as a “n umerical counterpa rt” for the MASSIV 
galaxy sample (iGontini et al.ll201^ . and for this reason 
they are tailored to high-z galaxies in terms of global 
pr operties (e.g., thei r sizes and SFR, see Figures 6 and 9 
of iPerret et al.l(^014l l. 

3.2. Post-processing of the MIRAGE Output 

To reproduce the data available for the HUDF galaxies 
also for the simulated MIRAGE sample, we extracted 
from each simulation snapshot a stellar mass density map 
and three stellar flux maps, in the HST/ACS F435W, 
F850LP and i7S'T/WFG3 F160W filters. We used both 
face-on and edge-on line of sights computed from the 
angular momentum of the most massive disk, at an initial 
resolution of 50 pc. 

The mass maps were simply obtained by projecting the 
distribution of stellar mass particles in the simulations. 
We instead derived the mock observations by assigning a 
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Figure 4. Left: observed //—band magnitude versus mass relation for real galaxies and the simulated MIRAGE isolated disks. With 
magenta triangles we show the evolution in mass and luminosity of the isolated disks in the MIRAGE simulation over the 600 Myr here 
considered. Blue circles are the MIRAGE simulated mergers. The black points correspond to real CANDELS/GOODS-S galaxies with 
1.5^2: ^3. Right: bzH (F435W-F850LP and F850LP-F160W) colors for the simulated disks and observed galaxies. The horizontal and 
slanted lines di vide the bzH plane in the locus of z 2 star-forming (SF) and quiescent galaxies (Q) following a similar approach as the 
BzK selection flPaddi et al.|[^04blV Symbols are the same as in the left panel. The MIRAGE simulations are representative of typical 
2 : 2, star-forming galaxies. 


Starburst99 (jLeitherer et al.l 119991) instantaneous burst 
model with a Salpeter IMF and an effective metallic- 
ity Z = 0.004 to each star particle on the basis of its 
formation age. This metallicity corresponds to the in¬ 
tegrated value that is expected for a disk galaxy with 
a central metallic ity in agre e ment with observations of 
z ~ 2 galaxies by lErb et aH (j2006D . We note here that 
no dust extinction was applied to the simulated fluxes. 

To produce flux and mass maps that can be directly 
compared with observations we: 

1. Generated observed-frame F435W, F850LP and 
F160W images by redshifting the Starburst99 
SEDs and applying cosmological dimming. We as¬ 
sumed a reference redshift z = 2. 

2. Matched the artificial F435W & F850LP images 
to the HST/ACS resolution (^0"08) and the 
i7—band and mass maps to the HST /WFC3 reso¬ 
lution (^0."15). To do so, we convolved the artifi¬ 
cial maps with the PSFs kernels constructed from 
stars in the HUDF held (see Section [2^ . 

3. We also pixelized the simulation images to the ACS 
and WFC3 pixel scales (0."03 and 0."06, respec¬ 
tively), again assuming an average z=2. 

4. Finally, we added noise to the flux and mass maps 
mimicking the typical imaging depth and the un¬ 
certainty in the reconstruction of the mass distri¬ 
bution for the real images. For the mock HST 
images, this was done by adding poissonian noise 
(on the maps converted in electron units) and by 
inserting the simulations into empty sky regions 
extracted from the real observations. We used sky 


regions from the HUDF12 area, when considering 
the iJ—band mock images, and instead sky areas 
from the HUDF tiles for the F435W and F850LP 
hlters. To account for uncertainties on the mass 
maps, we added a mass-dependent error calculated 
from the observed mass versus error relation for in¬ 
dividual pixels in the sample of real galaxies. We 
matched the simulations to the deepest available 
photometry only, i.e., HUDF -|- HUDF12, as we test 
the effects of different imaging depths on the data 
itself in Section [T] 

We then ran SExtractor on the noisy, stellar mock 
images with similar configuration parameters as those 
employed for the construction of the GOODS ACS multi¬ 
band catalogs in the case of the artificial F435W and 
F850LP imagejq and using instead the “cold-mode” pa¬ 
rameter settings of the lGuo et al.l 1)20131) CANDELS mul¬ 
tiwavelength catalog in the case of the mock iJ—band 
images. The SExtractor outputs were used for de¬ 
tecting/deblending the merging galaxies and to define 
their photometric properties, e.g., sizes or fluxes. 

The integrated luminosities and colors of the MIRAGE 
galaxies resulting from such post-processing are pre¬ 
sented in Figure SI where we compare the simulations 
with real 1.5 ^ z ^ 3 galaxies in the GANDELS/GOODS- 
S field. There is a good agreement between the MIRAGE 
disks or mergers and the real sample of galaxies. At high 
masses, the MIRAGE models tend to be slightly brighter 
than observed galaxies, most likely as a consequence of 
applying no dust extinction to the simulations. Although 
limited by the intrinsic colors of the Starburst99 tem- 

http://archive.stsci.edu/pub/hlsp/goods/catalog_r2/ 
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Figure 5. Upper panels - magenta frame: observed-frame mock flux maps and stellar mass density map for one of the isolated disk in 
the MIRAGE simulations (face-on projection for the isolated disk ‘G2’) at a time t=276Myr from initial conditions. From left to right we 
show the flux maps in the F435W, F850LP, F160W Alters and the stellar mass map, respectively. All images are redshifted and rebinned 
to the HST pixel scale for an average redshift z = 2 and convolved to the typical resolution of each band. In the top row are the original 
simulation snapshots and the second row presents the same maps once degraded to the typical S/N of the HUDF/HUDF12 observations. 
For the flux maps the color coding shows positive deviations with respect to the mean sky r.m.s. (with the color map ranging from 0 
to 20cr). Middle panels - blue frame: as above but for a simulated merger (minor merger involving the G1 and G3 disks with orbital 
parameters =90 , 62 = 90 and k = 90 , see lPerret et al.]|2014l) . Lower panels: the F435W, F850LP, F160W images and the derived 
stellar mass map for a real 2 ~ 1.8 galaxy in the HUDF held are given for comparison with the simulations. The MIRAGE simulations 
well reproduce the wavelength-dependent, complex morphology of real z~2 galaxies. 


plates used to construct the flux maps, the simulated 
galaxies have also (5 — z) and {z — H) colors that are 
typical of 2 ; 2 star-forming galaxies. 

Examples of the mass and flux maps before and after 
noise degrading are presented in Figure 0 for an isolated 
disk and a merger simulation. In the same Figure, we 
also show for comparison a real z ~ 2 galaxy extracted 
from the HUDF field. The simulated galaxies well re¬ 
produce the morphological/structural properties of real 
observed galaxies and their variation with wavelength of 
observation. 

4. CAVEATS AND GENERAL COMMENTS 

Before proceeding with the analysis, we clarify here 
some definitions and address caveats which are relevant 
for the following discussion. 


4.1. Definition of Merger in Our study 

As specified in the Introduction, our intent is to op¬ 
timize the classification for galaxies that are close to 
the coalescence phase, with less interest to early merger 
stages which can be identified with other techniques (e.g., 
kinematic pair selection). For this reason, HUDF galax¬ 
ies in pairs with a sufficient separation to allow distinct 
identification will be considered as individual objects. 

Practically, this means that we perform the struc¬ 
tural measurements and classification separately for 
any galaxy appearin g as a single entry in the par¬ 
ent I Guo et ahf (|2013ll catalog, even if the galaxy is in 
a close pair with another in the catalog. Moreover, 
this also implies that any galaxy displaying multiple 
clumps/components which do not appear as individual 
sources in the H—band catalog will be considered as 
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a “multi-component” single system, being it either a 
clumpy galaxy or a merger (but see next section). 

The ability of separating two galaxies in real observa¬ 
tions depends on several aspects, such as the intrinsic 
surface brightness and brightness contrast of the merg¬ 
ing galaxies, the signal-to-noise of the images, the size 
of the galaxies, etc. Hence it is not straightforward to 
convert the SExtractor deblending threshold into a 
unique minimum distance between the two galaxies. On 
average, however, the closest s eparation at whic h the 
HUDF 1.5 < z ^ 3 galaxies in the lGuo et al.l (|2013[ 1 cata¬ 
log are still deblended is of ^ 10 kpc; our merger sample 
will hence include galaxies at smaller separations. 

Following the same philosophy, we extracted from 
the simulations a sample of close-to-coalescence merg¬ 
ers which we refer to as “pre-coalescence” merg¬ 
ers/snapshots. In this sample are excluded both pre¬ 
merger (i.e. well separated) galaxietE3 and post-merger 
remnants (i.e., snapshots extracted at a time after the 
coalescence of the two nuclei). In fact, galaxies in such 
phases have either to merge yet or have already com¬ 
pleted the merger phase and have settle in their final, 
unperturbed state with no clear structural signatures of 
the past merger event. In the simulation, and possibly in 
the observations too, such galaxies are indistinguishable 
from the isolated galaxies. 

Among these pre-coalescence mergers, some reach an 
almost unperturbed appearance already a '^20 Myr be¬ 
fore the coalescence time. Although these simulated 
galaxies are still nominally undergoing a merger, they 
also occupy a locus of structural parameters which is 
overlapping with that of “normal” galaxies. To obtain an 
as pure as possible merger training sample, we excluded 
these simulations from those employed to calibrate the 
classification in Section 151^ We refer to this clean set of 
mergers as the “training-sample”. 

With such a selection the resulting input training/pre¬ 
coalescence sample of simulated galaxies is roughly 
equally divided among mergers and isolated disks 
(82/110 and 96 galaxies respectively) and 70% of these 
mergers have a ratio <2.5, i.e. are major mergers. 

4.2. Keeping Projection Effects under Control 

Obviously, for real data the presence of multiple com¬ 
ponents which are not deblended into individual sources 
does not necessarily imply that we are looking at a sin¬ 
gle clumpy galaxy or that the components are physically 
participating to a merger. Especially for those galaxies 
displaying extreme color gradients, one may worry about 
chance projections. As mentioned in Section [2j3] spectro¬ 
scopic redshifts are available for only a minority of the 
sample, hence we cannot rely on the spectral information 
to confirm associations. 

Another way to test this possibility is to compare the 
source identification/photo-z based on the i7—band ex¬ 
traction with those obtained at shorter wavelengths to 
see whether the individual clumps could in principle be 

In order to follow for the simulation an approach as close 
as possible to that applied on the real galaxy sample, we relied 
on the SExtractor output to determine when the two simulated 
galaxies can be clearly deblended (see Section 13.21 for details of 
the SExtractor run on the simulations). Consistently with the 
observations, this translates into a maximum separation of about 
10 kpc also for the simulated galaxies. 


identified as single objects located at a different red- 
shift and had simply passed undetected in the i7—band. 
Of course, the advantage of an iJ—band extraction is 
precisely that - by probing the rest-frame optical for 
z ^ 2 galaxies - it limits the “over-deblending” of galax¬ 
ies which may occur at the rest-frame UV. For this reason 
we keep the i7—band source identification as our refer¬ 
ence. However, we can use the short wavelength indepen¬ 
dent measurements to flag “bona-fide” multi component 
galaxies. 

For this purpose we used the publicly availa ble photo- 
z for the HUDF galaxies from the work of iCoe et all 
(|200(3D which are based on an hybrid i-ha,nd+BVizJH 
detection. Specifically, for each galaxy in our sam¬ 
ple which displayed several clumps or plausi ble com¬ 
panions we checked in the iCoe et all (120061 ) catalog 
whether these would have been identified as separate ob¬ 
ject and their photometric redshift. Whenever also the 
z-band extraction was consistent with only a single multi¬ 
clump galaxy or the phot-z of the individually extracted 
clumps/companion was consistent within errors with the 
redshift of the given galaxy, we considered it as a true 
multi-component system at best of our knowledge. Oth¬ 
erwise we flagged this galaxy to remind us that a chance 
projection cannot be excluded in this case. We high light 
such galaxies with a red exclamation mark in Table lAll 


4.3. Limitations of the Simulations 

The MIRAGE simulations offer an ideal sample for 
testing our approach as we argued in Section [3l Nonethe¬ 
less, there are also a number of limitations of which we 
must be aware when comparing with the real data. 

First, in spite of being realistic models of typical high 
redshift galaxies, they are not statistically representative 
of the full galaxy population and especially of the relative 
frequency of normal and merging galaxies. The simula¬ 
tion snapshots are in fact almost equally divided between 
disks and mergers (and the latter dominated by major 
mergers) which is not necessarily reflecting reality. For 
this reason, we do not expect that the relative densities 
of simulated mergers and disks in the structural planes 
investigated in the following should reflect the true dis¬ 
tribution of real galaxies. We will return to this point in 
Section [HI 

Second, the predicted stellar fluxes for the MIRAGE 
simulations are not obtained with a self-consistent, full 
radiative transfer treatment and they do not include the 
effects of dust extinction and scattering which are instead 
known to be impor tant in real galaxi es. As discussed in 
detail in the work of iLotz et al.ll2008R simulated mergers 
with no dust obscuration appear more concentrated, less 
asymmetric and have lower M 20 values than simulations 
in which dust obscuration is included. The MIRAGE 
flux/77—band maps are hence likely smoother and may 
trace the stellar mass more closely than in real high- 
z galaxies. Conversely, the real mass maps may also 
appear noisier than the simulated maps as a consequence 
of the extra image processing and SED fitting which was 
required for the real data (see Section [231 )• 

Third, the lifetimes of clumps in simulated galax¬ 
ies (and hence the observed dumpiness of galaxies) 
are sensitive to t h e assumed feedback model (see e.g., 
Genel et 'aT] l2012t iMandelker et al.l I20l4 iMoodv et alJ 
20I4D . Support for the feedback recipe employed in 
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the MIRAGE suite (|Renaud et al.l[20T^ comes from the 
fact that the clumps in the simulated disks well re¬ 
produce typical stellar ages 200 Myr) of observed 
clumps l)Bournaud et al.ll2ni4al l. However, our still in¬ 
complete understanding of feedback processes could af¬ 
fect the comparison of the simulated galaxies with the 
real data. 

Fourth, the sample of MIRAGE disks consists of purely 
isolated galaxies obtained from simulations that are lack¬ 
ing a full cosmological context. A number of processes 
acting on large scales could induce a higher clumpi¬ 
ness/asymmetry in the galaxies, including tidal interac¬ 
tions with massive galaxies or nearby satellites and gas 
infall through intergalactic streams. Large scale accre¬ 
tion would help maintaining the initial high gas frac¬ 
tions (60%) for longer times than in the isolated scenario, 
leading to more prominent instability-induced features 
(clumps, asymmetry, etc.). We note however that while 
fully cosmological simulations would overcome some of 
the aforementioned limitations, the lower resolution im¬ 
posed by the large cosmological volumes would also most 
likely result in smoother galaxies than in reality. 

Finally, a further complication arises from the fact 
that the gas fraction in the MIRAGE simulations ac¬ 
counts f or both molecular and atomic hydrogen. The 
study of iBournaud et all (|2015f) showed that a substan¬ 
tial ^20% of gas in the simulations is found in the 
atomic phase in moderate-density regions between the 
clumps and in extended reservoirs. This would lead to 
an underestimation of the molecular gas fraction, and 
cons equently dumpiness, with respect to observations 
fe.g.. lDaddi et al.ll2010HTacconi et al.ll201ClH . This is wit- 
nessed by the too weak GO excitation in these models 
IjPaddi et al.ll201-3l . 

As a consequence of the above, it is reasonable to ex¬ 
pect that the simulations display a somewhat smoother 
morphology in optical imaging than the real data. We 
will keep this caveat in mind. 

5. STRUCTURAL MEASUREMENTS 
5.1. Non-parametric Morphology 

On the real as well as simulated single band images 
and the stellar mass maps, we calculated several non- 
parametric structural indicators using our own purpose 
built routines. We consider hereafter only two such in¬ 
dices: 

1. the asymmetry A, which is the normalized resid¬ 
ual flux as obtained from the difference between 
the original image and its 180°-rota ted version 
(jGonselicel 1200.11 tZamoiski et al.l 1200711 . After ex¬ 
perimenting with different methods for defining the 
center used in the asymmetry computation, we 
chose as our fiducial estimate of A the value calcu¬ 
lated with respect to the peak of the emission/mass 
distribution. We find this option to be most sen¬ 
sitive to multiple components in the galaxies and 
thus to merger features E3 

2. The normalized secon d-order moment of the 20% 
brightest pixels, M 20 (|Lotz et al.ll2004ll . describing 
the spatial distribution of bright clumps. 

Except for the multiplicity in Appendix lAls] all other param¬ 
eters are referred to the light/mass centroid. 



Figure 6. Distribution of asymmetry and M 20 in the MIRAGE 
simulations (disks=magenta hatched histograms; mergers=blue 
hatched histograms) and for the sample of real 1.5 ^ 2 ^ 3 galax¬ 
ies in the HUDF field described in Section |2.3I (gray histograms). 
The upper row shows the measurements performed on the Lf—band 
mock and real observations while those obtained on the mass maps 
are presented in the lower row. The simulated galaxies well repro¬ 
duce the distributions of structural indicators measured on the real 
HUDF galaxies. 

The choice of these two indices is justified in detail in 
Appendix lA.Ii where we also describe the other indica¬ 
tors that were explored. Briefly, we found that combi¬ 
nation of M 20 and A indices is optimal in terms of both 
the capability of separating merging from normal galax¬ 
ies and the (in)sensitivity to S/N effects. 

It is known that dust extinction and young star¬ 
forming re gions affect short wavelength morpholo¬ 
gies (e.g.. iBohlin et al. 1 I199H I Giavalis co et al.l 119961 : 
IWindhorst et al.l 120021 : iPapovich et al.l I2003lb J^or this 
reason we performed our measurements on the iJ—band 
only for the real sample of HUDF galaxies (this is 
also the reference band used in recent morpholog- 
ical studies on the GOODS / CANDELS field s , e.g ., 
iHuertas-Companv et al.l 12014 iKartalteoe et al.l 12014 . 
In the case of the MIRAGE galaxies, on the other hand, 
we computed the structural parameters also for the ar¬ 
tificial F435W and F850LP images for self-consistency 
check and comparison purposes with the iL—band and 
mass maps. We measured the indices within the SEx- 
TRACTOR Petrosian semi-major axis of each filter and 
within the Petrosian semi-major axis of the iL—band in 
the case of the mass mapsF^ 

Figure ini shows the distribution of A and M 20 measured 
on the iJ—band and mass for the real HUDF galaxy 
sample (gray histograms) in comparison with the dis¬ 
tributions that are obtained on the MIRAGE isolated 
and interacting galaxies (magenta and blue histograms, 
respectively). Although the exact shape of the observed 

T he ch oice of the Petrosian aperture is also motivated in Ap¬ 
pendix roi 
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Figure 7. Left panel block: relation between A and M 20 , for the sample of MIRAGE simulated mergers and isolated disks. From top 
to bottom and left to right, the structural indices are measured on the artificial HST F435W, F850LP, F160W images and on the mass 
map as indicated by the figure labels. Isolated disks are shown with magenta triangles, while the dark blue points correspond to the 
training sample of mergers. The light-blue symbols are mergers that are nominally in the pre-coalescence sample but have already reached 
a visually unperturbed appearance, these galaxies are excluded from the SVM training set of mergers. The dashed lines highlight the 
maximum margin classifier separation between mergers and disks obtained with the SVM approach. The solid line in the lower right panel 
indicates the classification criterion for mass-selected mergers of Equation [T] Right panel block: for the snapshots falling above the SVM 
dashed lines, we plot in the top most panel the contamination from isolated disks and in the lower three panels the completeness level for 
mergers of all mass ratios and for major mergers (ratios 1:1 and 1:2.5) or minor mergers (1:6.3) only. Green, orange and red colors show 
the values obtained when using A and M 20 measured on the artificial F850LP images, F160W images and mass maps, respectively. The 
red stars indicate the completeness in the mass-based merger selection defined by the condition in Equation [T] (shown by a solid line in left 
panels). The completeness levels are always calculated with respect to all pre-coalescence mergers (i.e., including the light blue points). 
Errors on the contamination and completeness values are calculated from Poissonian statistics. 


gray histograms reflects the relative abundance of merg¬ 
ers and normal galaxies which is not necessarily matched 
in the simulations, the MIRAGE and observed galaxies 
clearly span the same region of the structural parameter 
space. This confirms the reliability of the simulations 
in probing the morphology of z 2 star-forming galaxies 
and enables us to use them for calibrating the classifica¬ 
tion in the next Section. 

5.1.1. Uncertainties of the Structural Indices 

For the real HUDF galaxies, we estimated the errors on 
the structural indices as follows. We resampled the mass 
maps and i7—band images 101 times by replacing each 
pixel value with a new value randomly extracted within 
the mass and flux uncertainties, assuming a Gaussian 
distribution. We then recalculated the structural indices 
each time and used the median value and the 16th and 
84th percentiles of all the 101 realizations as our final 
estimates of the non-parametric indices and the associ¬ 
ated errors. Using the median over the resampled maps 
rather than the directly measured indices has the advan¬ 
tage of down-weighting the impact of isolated extreme 
pixels with large uncertainties. 

5.2. Visual Multiband and Mass Morphologies 

As complementary information to the quantitative 
structural measurements, we also visually inspected all 


galaxies in the HUDF sample. We performed the vi¬ 
sual classification independently on composite bzH im¬ 
ages and on the mass maps. Each galaxy was assigned to 
one of the following three broad classes of morphology: 

1. “Gompact galaxies”. These galaxies are either con¬ 
sistent with a spheroidal morphology with no signa¬ 
tures of perturbations or too compact for detecting 
any structural feature. 

2. “Disks”, characterized by a regular, centrally sym¬ 
metric light or mass profile for which an underlying 
disk-like morphology is discernible; 

3. “Multi-component/disturbed” galaxies, which 
present several peaks/clumps and/or lopsided 
distribution of light or mass with no clear disk or 
spheroidal morphology. 

6. CALIBRATION OF THE MASS-BASED CLASSIFICATION 

We now turn to the main goal our paper, i.e., to ver¬ 
ify whether a mass-based classification can improve the 
identification of merging galaxies. We start by testing 
and calibrating the method on the simulated MIRAGE 
galaxies. 

6.1. Comparison with Single-band Classification in the 
Simulations 
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For the MIRAGE mergers and the isolated disks, we 
show in the left panels of Figured the relations between 
M 20 and A as measured on either the three optical/NIR 
bands F435W, F850LP and F160W or on the mass maps 
(we will use the subscript “H-BAND” or “MASS” to dif¬ 
ferentiate -ff—band or mass-based indices from now on). 
The F435W results are presented for a comprehensive 
comparison with the other bands and the mass maps, 
but a separation between mergers and disks on the arti¬ 
ficial F435W images is very difficult as in this case the 
two populations clearly overlap in almost the entire pa¬ 
rameter space. Therefore, we do not consider the F435W 
band for the quantitative analysis described in the follow¬ 
ing. 

On the F850LP, F160W and mass panels of Figure [7] 
we ran a support vector machine (SVM) algorithm and 
identified the best dividing line between simulated merg¬ 
ers and disks (dashed line in the figure). We remind 
that for this calculation we excluded from the training 
sample of mergers any pre-merger galaxy, merger rem¬ 
nants and mergers which have reached an unperturbed 
state (light points in Figure [T]). Our method is hence 
optimized to select galaxies that are clearly displaying 
interaction features. For any given sample of snapshots 
falling above the dashed lines in Figure [71 we then es¬ 
timated the “contamination” and the “completeness” of 
the selected merger sample - i.e., the fraction of misclas- 
sified isolated disks among all selected snapshots and the 
fraction of all pre-coalescence merger galaxies which are 
correctly classified. The results of these calculations are 
summarised in the right panels of Figure [ 7 ] where we plot 
contamination (uppermost panel, circles) and complete¬ 
ness (lower three panels, triangles). We use a red symbol 
when the classification is performed on the mass maps, 
orange when it is based on the F160W images and green 
for the F850LP filter. 

There is a clear trend of decreasing contamination from 
clumpy disks going from the z- to i7-band and mass 
maps, supporting our approach. Although in the sim¬ 
ulations the difference between i7—band and mass is not 
statistically sig nifica nt, this trend is reinforced by the re¬ 
sults in Figure IA31 showing that the contamination is 
minimized if the classification is performed on the mass 
maps also for other combinations of structural indicators. 
We emphasize here that as a consequence of all limita¬ 
tions listed in Section IT^ the simulated i7-band images 
likely trace the mass distribution more closely than in 
real galaxies in which, e.g., patchy dust obscuration or 
higher gas fractions, will increase the galaxy dumpiness 
and worsen the ability of separating mergers from disks. 
We hence expect that the differences between the mass- 
based and flux-based classihcation should be more pro¬ 
nounced in real data. This hypothesis is supported by 
the observational results discussed in Section [71 

We also note that the mass-based selection of merg¬ 
ers reaches a completeness level that is comparable to 
the i7—band for simulated major mergers, whereas it is 
somewhat lower for the minor mergers. This is not sur¬ 
prising since, by definition, the mass-based classification 
identifies galaxies with large mass contrasts/asymmetries 
and hence is less sensitive to minor mergers. In the sim¬ 
ulations, some minor mergers can still be detected in the 
optical/NIR bands as a consequence of SFR enhance¬ 
ments. Again, in real observations the classification of 


these minor mergers will be further complicated by the 
effects discussed above. 

6.2. Quantitative Definition of Mergers 

We use the results of the SVM partition to define a 
quantitative criteria for selecting mergers based on the 
position in the M 20 . MASS and Amass plane. We con¬ 
cluded in Section I 473 I that an exact one-to-one match be¬ 
tween the relative numbers of disks and mergers in the 
simulations and the real data is not expected. For this 
reason we have refrained from performing any fine-tuning 
of the coefficients in the relation between M 2 o,mass and 
Amass such as to optimize the completeness versus con¬ 
tamination level. This would in fact depend on the input 
merger fraction in the simulations. We rather follow the 
conservative approach of identifying a locus in the mass- 
derived M2o,mass versus Amass plane which - unless 
real galaxies have extreme mass distributions that are 
not reproduced by current models - should be populated 
by mergers only. This is of course still model dependent, 
but less affected by the specific choice of the merger frac¬ 
tion in the simulations. 

Our simulation-justified dividing line between mergers 
and normal galaxies is hence given by the following rela¬ 
tion: 

Af2o,MASs > -lA X Amass — 1A2, (I) 

and it is obtained by simply applying a shift to the best 
SVM line in the M2o,mass versus Amass plane of Figure 
jTjsuch to exclude the most extreme disks. This relation 
is shown with a solid line in Figure [T] 

The completeness level over all pre-coalescence merger 
snapshots (including the unperturbed light-blue points 
in Figure [7]) reached with the selection of Equation [T] 
is shown with a star symbol in the lower-right panel in 
Figure [71 As a consequence of our rather conservative 
choice of the dividing line in Equation [TJ the mass-based 
classification results in a very high purity of the selected 
mergers at the expense of the completeness of the merger 
sample. On the MIRAGE simulated data, we estimate 
a completeness of up to ^ 40% for major mergers (ratios 
I:I to 1:2.5) and up to 10% completeness for the minor 
mergers when applying Equation [TJ 

When dealing with real data, it is useful to introduce, 
together with the binomial classification of Equation [TJ 
also a more probabilistic description of the M 2 o,mass ver¬ 
sus Amass plane which could be used to assign a merger 
likelihood on individual galaxies. For this reason using 
the results from the simulations, we also derived a contin¬ 
uous parametrization of the contamination from clumpy 
disks over that plane. This was obtained by modeling 
the number densities of the mergers and disk snapshots 
with a Gaussian mixture approach and calculating the 
fraction of isolated disks over all snapshots in any given 
region. As we discussed in detail in Section IT31 the rel¬ 
ative number of mergers and disks in the simulation is 
almost 1:1, whereas the major merger fractions reported 
in the literature at the redshift considered here are of or¬ 
der of 10% (e.g.,lBluck et al.ll2009l: iNewman et al.l[20I2l : 
iWilliams et al.ll20IlHMan et al.l 20I4[l . To derive a real¬ 

istic value of the relative abundance of disk and merger 
snapshots in M2o,mass versus Amass space, we hence 
rescaled the number counts of simulated mergers such 
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Figure 8. Left: location of real galaxies with known kinematic classification on the M2 o,mass Amass plane, in comparison with the 
results from the MIRAGE simulations. The simulated isolated disks are shown as magenta triangles and the blue symbols are simulated 
mergers, as in Figure [7| (light blue corresponds to pre-coalescence mergers with already a regular, unperturbed appearance). Real observed 
galaxies that have publicly available kinematic classification are plotted as large points with error bars: in black for kinematically confirmed 
disks and in green for the known merger. The sample of observed galaxies with kinematic informat ion is composed by the subset SINS 

S ’ ixies with kinemetry classification in the GOODS-S area and by the clumpy disk in the study of IBournaud et al.l 1|20Q8I') (see Section 
for details.) Right: as on the left panel, but now the color-coding provides the level of contamination from isolated disks (i.e., the 
fraction of disk snapshots over all snapshots) in any given area of the M 20 MASS ^MASS- The shading is obtained by modeling the 
disk and merger populations with a gaussian mixture and assuming a total merger fraction of 10% (see text in Section lOli . The division 
into mergers and disk galaxies based on the simulations and Equation ^ (dotted line) is in very good agreement with the position of real 
kinematically confirmed disks and mergers. 


as to reach a total merger fraction of 1O%0 We show 
this probabilistic mapping of disk contamination in the 
-M20,mass versus Amass plane in the right-hand panel 
of Figure |H1 This information can be used to refine the 
classification on a case by case analysis. 

6.3. Validation on Real Galaxies with Kinematie 
Classification 

A kinematic classihcations into rotationally supported 
disk or merg er is currently available for some z > 1 
galaxies le.g.. lEo inat et a Lll2009l : IBournaud et al.l[20081 
iForster Schreiber et al.ll2009f) which can thus be used to 
further validate the calibration of the classification in the 
Af20,MASS versus Amass plane obtained from the MF 
RAGE simulations. For this test we consider galaxies 
with a similar optical-|-NIR wavelength coverage as our 
HUDF sample. 

In pa rticular, a subset of the gal axies in the SINS 
survey (|F6rster Schreiber et al.l[200^ are located in the 
GOODS-S field and thus stellar mass maps can be 
built using the GOODS + CANDELS data. Specifically, 
we refer to those SINS galaxies which have a kineme- 
try classihcation in Table 9 of IForster Schreiber et al.l 

This is simply done by down-weighting the merger counts by 
the factor Wm = (fm x Nd)/({1 - fm) x Nm) such that fm = 0.1 = 
Wm X + Wm X Nm) ■ In the above, = 96 and Nm = 88 

are the original number of simulated disks and mergers and fm is 
our target fraction of 10%. 


(|2009ll . There are three such galaxies falling in the CAN- 
DELS/GOODS field: K20-ID6, K20-ID7, K 20-ID8 which 
were o riginally extracted from the sample of iDaddi et al.l 
I2004ai K20-ID6 and K20-ID8 are classihed as disks, 

while the nature of K20-ID7 is less clear as it has 
been classified as merger by the kinemetry analysis in 
IForster Schreiber et ahl (1200911 or as a dis k displaying dis- 
turbed rotation in iTacchella et al.l (I20I5I1 . We label it as 
“perturbed/merger” to indicate that it is not a regular 
disk. For another galaxy that is included in our sample 
of galaxies with HUDF + HUDF12 coverage (GANDELS 
ID 15011 or UDF ID 6462) a kinematic analysis based 
on SINFONI data has been published in the study of 
IBournaud et al.l (|2008f) . As discussed in that work, in 
spite of its disturbed appearance this galaxy clearly dis¬ 
plays a rotational motion in the Ha velocity held. We 
hence include this object in the sample of kinematically 
classihed disks. We performed our mass map analysis 
on these galaxies and compare in Figured their location 
with the expectations from the simulations. 

Inspecting Figure [SJ we hnd a very good agreement 
between our simulation-based partition of the M 2 o,mass 
versus Amass plane and the location of the observed 
galaxies: all kinematically conhrmed disks lie below the 
line dehned in Equation [T] in the disk-dominated locus 
and the kinematic merger/perturbed galaxy is on top of 
the merger locus, on a region where the disk contamina¬ 
tion is expected to be < 20% at the HUDF depth. 
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Figure 9. Evolution of the Asymmetry and M 20 indices in the MIRAGE simulated disks and mergers. From left to right the four columns 
show the results obtained from the mock observations in the F435W, F850LP, F160W filters and on the mass maps. The upper-most, 
isolated panel shows for reference the SFR histories of the simulated galaxies. Merger simulations are shown as blue lines of increasingly 
darker shades with increasing merger ratio (see figure legend) while the magenta lines correspond to the isolated disks. The x-axis is the 
simulation time with respect to the coalescence time (shown with thin dotted line a f = 0), negative times hence indicate an on-going 
merger while for positive t values the two galaxies have already merged into a single object. No coalescence time exists for the isolated 
disks which are simply shown over the time range considered. Note that the pericenter time for simulated mergers is at about t = —200. 
The shaded gray area in the right panels indicates the time scale over which the simulated mergers satisfy the condition in Equation ^ 
between the M2 o,mass Amass indices and thus would be classified as mergers according to that criterion. 


We note that the real kinematically confirmed disks 
display somewhat higher Amass o,nd M 2 o,mass values 
than the bulk of simulated MIRAGE disks. As men¬ 
tioned in Section 01 a lower gas fraction in the simu¬ 
lations with respect to real galaxies or the lack of dust 
could be responsible for the difference. Furthermore, we 
discuss in Section 17.11 that a lower S/N causes a shift to 
higher M 20 . Small modifications to Equation[T]should be 
applied when using data that is shallower than HUDF, 
as for the SINS galaxies. When accounting for this effect, 
we still find a good agreement between the kinematic and 
mass classification, with galaxies K20-ID6, K20-ID8 and 
ID 15011 falling in the disk-dominated region and the 
perturbed galaxy K20-ID7 on the transition region be¬ 
tween disks and mergers, consistently with its ambiguous 
kinematic classification. 

6.4. Probed Timescales and Merger Ratios 

We can also use the simulations to convert the struc¬ 
tural selection into a typical observability window during 
which mergers are identified as such by the mass-based 


classification. Figured shows the evolution in time of the 
structural parameters in the MIRAGE simulated disks 
and mergers. As a further illustration of the results of 
Section 16.11 we present the time variations for indices 
measured on the stellar mass maps as well as for those 
derived on the three optical/NIR bands. The different 
shades of blue illustrate the evolution of mergers with 
different mass ratios (increasing merger ratio for darker 
shades), whereas the magenta lines are for the isolated 
disks. For interacting galaxies we trace the parameters 
from an early merger phase at 50Myr before pericenter - 
when the two galaxies are still separated and hence the 
structural indices are measured on the primary only - 
to after the complete fusion between the two galaxies. 
The peak observed in most of the indices coincides with 
the pre-coalescence phase that we are targeting with our 
selection; clearly it becomes more distinguishable from 
other phases when moving from the optical to the NIR 
and the mass maps. 

The shaded gray areas in the right panels of the figure 
highlight the time interval over which 90% of the merg- 
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Figure 10. M 20 versus asymmetry plane for the sample of 87 1.5^ z ^3 galaxies in the HUDF area with reliable mass maps. In the left 
panels we present the structural indices measured on the band images while on the right panels are those obtained on the stellar mass 
maps. From top to bottom measurements performed on images of increasingly shallower depth are presented, as highlighted by the figure 
legend. The deepest HUDF + HUDF12 are available only for 50 of the 87 galaxies; the points in the upper-most panels are hence a sub-set 
of thos e in t he other planes. The different symbols correspond to the broad visual morphological classes in which the sample is divided in 
Section l5. 21 triangles = disk galaxies; circles = compact/smooth galaxies; lemniscate = multi-component/disturbed galaxies. Grey symbols 
highlight the two galaxies for which the mass map reconstruction is uncertain due to large contamination from companion galaxies. The 
dotted lines indicate the relation M2 o,mass > —1.1 x Amass — 1-12 dividing simulated mergers and disk galaxies in Equation^ The 
dashed lines are the depth-adjusted criteria of Equation (2] In the right-hand panels, the background shaded areas reproduce the disk 
contamination fraction presented in Figure[8]for simulations matched to the HUDF -hHUDF12 depth. As illustrated by the several orange 
triangles in the left panels, merger samples selected using Ah— band ^-nd M2q^h-BAND cuts are substantially contaminated by visually 
clumpy galaxies that however have smooth, disk-like mass profiles. 
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Figure 11. Composite bzH image, ff—band stamp and mass map (from left to right) for galaxies which have a sufficiently asymmetric 
appearance in the band to be classified as mergers on the basis of their Ah— band and M20,h—BAND values, but are instead visually 
and quantitatively classified as normal, disk-like galaxies based on their mass distribution and the -Amass and M 20 MASS indices - see filled 
orange triangles above the dotted line in the lower left panel of Figure 1101 We mark the ff-band stamps with “C” or “H12” depending 
on whether the galaxy satisfies the criteria for being a merger when using the CANDELS-Deep //—band photometry or the HUDF12 
images (or both). A sign highlights those galaxies for which only CANDELS observations are available. All images are 3" wide. When 
available, the stamps and mass maps from the HUDF/HUDF12 imaging are presented; HUDF/CANDELS-Deep images are shown in the 
other cases. 
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ers identified by the selection in Equation [T] are found: 
following our classification criteria we are able to iden¬ 
tify mergers for about 200 Myr, i.e. between 300 Myr and 
50Myr before coalescence. This time scale is consis tent 
with those reported by, e.g.. lLotz et al.l ()2008bl [2010(1 for 
optically selected mergers, although these authors find 
a wide range of observability time-scales (between ~0.1 
and 1 Gyr) depending on the orientation, type of mergers 
as well as on the structural indicators used to detect the 
merger features. 

Finally, in the top panel of Figure |9] we show for com¬ 
parison the evolution o f the SFR in the si mulated galax¬ 
ies (see also Figure 7 of iPerret et al.l[20Tll . Interestingly, 
the variation in the structural parameters occurring dur¬ 
ing the merger event is not associ ated to a cle ar SFR 
enhancement in the simulations. iPerret et al.l (120141 1 
list several plausible causes for the lack of the starburst 
phase, including the interplay between the high gas frac¬ 
tions and the feedback treatment in the simulations. A 
detailed study of the star formation histories in the MI¬ 
RAGE simulations is beyond the scope of this paper (and 
is also affected by uncertainty in the models), the infor¬ 
mation is presented for completeness here but we will 
not further speculate on the links (or absence thereof) 
between the structural properties and star formation ac¬ 
tivity of the simulations. 

7. MASS VERSUS LIGHT: CLASSIFICATION FOR REAL 
HUDF GALAXIES AT DIFFERENT DEPTHS 

Building on the results from the previous section, we 
now apply the mass-based classification and compare 
the outcome with the results from the 77—band im¬ 
ages for the 87 galaxies with reliable mass maps in the 
HUDF area (i.e., the 89 galaxies with SExtractor kron 
radii larger than 5 x the PSF FWHM and magnitudes 
77 ^24.5 minus the two galaxies with mass maps affected 
by strong contamination from neighboring galaxies, see 
Section 12.51) . 

For the galaxies in the HUDF area, the availability of 
multi-depth data allows us to verify the reliability of the 
mass-based classification at different S/N. Specifically, 
we consider mass maps derived from the three combi¬ 
nations of optical plus NIR imaging described in Sec¬ 
tion 12.41 (I) GOODS and CANDELS-Deep data, (2) 
HUDF plus CANDELS-Deep, and (3) the deepest maps 
obtained from the HUDFI2-I-HUDF observations. Of 
the 87 galaxies here considered, 50 galaxies have cov¬ 
erage at all depths, while for the other 37 only the first 
two versions of the mass maps could be derived. For 
comparison with the mass maps results, we also consider 
structural parameters measured on the 77—band images 
at the HUDF12 or CANDELS-Deep depths. 

In Figure [10] we present the location of the HUDF 
galaxy sample in the M 20 versus Asymmetry plane, ei¬ 
ther derived from the 77—band images (left panels) or the 
stellar mass maps (right panels). From top to bottom we 
show measurements performed on images of increasingly 
shallower depths. The dotted lines in the figure indi¬ 
cate the dividing locus of Equation [T] and t he d ifferent 
symbols highlight the visual types of Section 15.21 In the 
following we will regard galaxies with a visual class for 
the mass maps that is inconsistent with the quantitative 
criterion on Amass and A72 o,mass as failures of the mass- 
based classification, i.e., as contaminants in the merger 


sample. In some cases such galaxies display genuine color 
gradients which could justify the mass asymmetries, but 
we will take the conservative approach of assuming they 
are the result of noise in the mass maps0 

7.1. (Falsely) H—band Selected Mergers 

We start the analysis on the real galaxies by identify¬ 
ing in Figure Uni those galaxies which would be classified 
as mergers based on the irregularity of their 77—band 
images. For a consistent comparison with the selection 
based on the mass map, we consider as mergers can¬ 
didates those galaxies that satisfy Equation |T] also for 
the 77—band measurements, i.e., galaxies above the dot¬ 
ted lines in Figure [101 Other criteria are often used in 
the literature when using M 20 and asymmetries derived 
from optical images. At low redshift (z< 1) a cut at 
A > 0.3-0.35 is typically applied to select major mergers 
(|Conselicell2003l : iLotz et al.ll2008bll but for z > 1.5 galax¬ 
ies a value of A > 0.2 is preferred to account for the effects 
of the morphological fc-correction, the decrease in the im¬ 
age r e solution and surface b rightness dimming (jConselicel 
120031 iConselice et al.l 120051) The distribut i on cA irregu¬ 
lar galaxies in Figure 10 of iScarlata et al.l (|2007f) would 
also suggest a division at M2o,h-band > -^-7 to iden¬ 
tify mergers candidates. We have checked (and it can 
be verified by inspecting Figure ITOl) that the result here 
presented would not change if these other criteria are 
applied. 

More than 90% of the galaxies that satisfy Equation 
[T] in the 77—band were assigned to the class of “multi¬ 
component/disturbed” galaxies also by the visual bzH 
morphological analysis. This means that these galax¬ 
ies would be classified as mergers by both a multi¬ 
wavelength visual inspection and a quantitative struc¬ 
tural analysis on the 77—band. Gonversely, it can be 
seen that several of the merger candidates lying above 
the dotted line in the 77—band planes are visually clas- 
sihed as normal disk galaxies based on their mass map 
appearance (orange filled triangles in the left panels of 
Figure [To)) . These galaxies have accordingly low asym¬ 
metry and M 20 indices measured on the mass map (see 
right panels of Figure [TU]). At both the HUDF 12 and 
CANDELS-Deep depth, about ~ 50% of those galaxies 
with large enough Ah-band and M2o,h-band to be 
classified as mergers in the 77—band have a mass pro- 
hle consistent with that of a normal disk galaxy and fall 
below Equation [T] when the mass-based indices are used. 
To illustrate these differences between mass and 77—band 
classification, we present in Figure [TT] the stamp images 
for those galaxies with Ah-band and M2o,h-band as 
high as those of mergers (using either the HUDF12 or 
CANDELS photometry) but a disk-like mass profile. In 
spite of displaying composite bzH and 77—band mor¬ 
phologies dominated by multiple clumps and asymmetric 
light distribution which could be suggestive of a merger 
event, all such galaxies have a regular, centrally concen¬ 
trated distribution of mass, typically associated with a 
red nucleus. 

Note that we also implicitly assume here that there are no 
systematic biases in the derivation of the mass maps, which could 
generate artificial multi-component, merger-like features in high 
S/N maps. 
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Mass-based Classification w.r.t. HUDF+HUDF12 mass maps 
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Figure 12. Top: comparison between the classification based on Amass ^20, MASS calculated on the stellar mass maps constructed 

with deep HUDF H-HUDF12 photometry and the classifications which are instead obtained from mass maps derived from the shallower 
combination of HUDF + CANDELS-Deep (left) or GOODS + CANDELS-Deep imaging (right). Eor galaxies classified as either non¬ 
interacting or mergers according to Equation [2 when using the HUDF + CANDELS or GOODS + CANDELS data, we show: with a dark 
histogram the fraction that have a consistent classification at the HUDF -|- HUDF12 depth and with a light histogram the fraction that is 
misclassified at the shallower depths, namely the fraction of HUDF + CANDELS-Deep or GOODS + CANDELS-Deep merger candidates 
that are instead classified as disks from the HUDF -I-HUDF12 maps, and vice versa. For each class, the light and dark histograms sum 
up to one. White error bars indicate Poissonian uncertainties on the measured fractions. Bottom: As above but for the classification 
performed on the iL—band images. In particular, we plot the fraction of galaxies classified as mergers or disks based on the Ah— band and 
A^20,H—BAND measured on the HUDF12 (left) or CANDELS-Deep images (right) that have a consistent or a different classification when 
using Amass and M 20 mass from the HUDF -I-HUDF12 mass maps. At any depth, the classification based on measurements performed 
on the H—band results in a twice as large contamination of clumpy galaxies in the mergers sample than the mass-based classification. 


7 . 2 . Mass-selected Mergers 

Focusing on the right panels of Figure [TUI we can in¬ 
stead study the results of the measurements performed 
on the mass maps. At the HUDF -I-HUDF12 depth - 
top most panel - we find an excellent agreement be¬ 
tween the locus of mergers expected from the analysis 
on the simulations (see light shaded area and dotted 
line) and the visual classes for the real data. A compar¬ 
ison with the other right-hand panels, however, reveals 
some important facts: not surprisingly, the separation 
between galaxies visually classified as disks/compact and 
galaxies with visually disturbed morphologies becomes 
less clear as the S/N of the mass maps decreases. At 
the GOODS -I- CANDELS-Deep depth, the distribution 
of galaxies in the Amass and M 2 o,mass plane is notice¬ 
ably more clustered than for the structural parameters 
derived on the HUDF -I-HUDF12 mass maps. 

It also appears rather clearly from Figure fTOl that the 


bulk of real galaxies shifts toward higher M 2 o,mass val¬ 
ues as a consequence of the decrease in S/N. This is 
particularly evident when comparing the data points 
with the shaded locu s def ined by the simulations. We 
showed in Appendix IA.3I that, while the Asymmetry 
is almost insensitive to the image noise, some depen¬ 
dence of the M 20 on the S/N is expected and variations 
of the order of 10% are measured for M 20 indices de¬ 
rived at different depths. This suggests that the crite¬ 
rion in Equation [U which was derived for simulations 
reproducing the HUDF -I-HUDF12 depth, should be re¬ 
vised as an “adaptive” threshold varying with the im¬ 
ages S/N. Furthermore, we note that at the shallower 
depth a few merger candidates appear in a region of very 
low asymmetry Mmass < 0.1) but very high M 2 o,mass 
(A r 2 o,MASS > ~1) where virtually no simulated mergers 
nor galaxy visually classified as merger are found. We 
suspect these galaxies to be affected by measurement er- 
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rors. 

We use the above empirical findings to provide refined 
criteria to robustly select mergers at the various depths: 

r-1.12 H + H12 

-M20,MASS > —1-1 X ^MASS + < —1-00 H + C 

[-0.98 G + C 

^MASS > 0.1 , (2) 

where “H-fH12”, “H-hC” and “G-^C” stand for 
the mass maps derived from the HUDF-|-HUDF12, 
HUDFCANDELS-Deep and GOODSCANDELS- 
Deep photometry, respectively. These new relations 
where obtained by adding to the locus of Equati on [T ] 
the average offsets in M 2 o,mass measured in Figure EH 
The new selection limits are shown with dashed lines in 
Figure [TOl 

By selecting as merger candidates those galaxies 
that satisfy the generic Equations [T] we find a total 
of 9, 21 and 26 mergers for the HUDF-|-HUDF12, 
HUDFCANDELS-Deep and GOODS-f CANDELS- 
Deep combinations, respectively. Of these 2, 6 and 10 
have a visual class that is inconsistent with the quanti¬ 
tative analysis on the mass maps, i.e they have been as¬ 
signed an either “compact” or “disk” class in the visual 
inspection of the mass maps. The use of the fixed selec¬ 
tion of Equation [T] would hence result in an increase of 
the contamination from 20% for the HUDF -|- HUDF12 
maps to 40% for the GOODS -|- GANDELS-Deep combi¬ 
nation. Conversely, using the relations in Equation [H we 
find 9, 15 and 15 mergers candidates with a total of 2, 
3 and 4 galaxies visually classified as non interacting, 
namely a roughly constant contamination of 20%, re¬ 
gardless of the image depth. 

What should be stressed here is that applying the 
‘adaptive’ criteria of Equation [H also to the H-Band 
would not reduce the contamination of clumpy disks: 
as it is clear from the bottom left panel of Figure [TOl 
these clumpy disks are distributed everywhere in the 
M 20 , H-BAND and Ah-band plane and there is no selec¬ 
tion that would substantially reduce the contamination. 

As a validation of the latter statement we considered 
the 50 galaxies for which all three versions of the mass 
maps are available and, for each combination of the pho¬ 
tometric data, we independently identified the samples 
of non-interacting galaxies and mergers, even when ac¬ 
counting for errors (i.e. we selected as merger candidates 
those galaxies that have error bars above the relations 
in Equation (2). We did so for both the H-band and 
mass structural indices. We then assumed that the clas¬ 
sification performed using the deep HUDF-|-HUDF12 
mass maps is the “correct” one and we calculated for 
the mergers and disks identified at the other depths or 
in the Ff—band the fraction that have a consistent clas¬ 
sification - i.e. mergers or non-interacting galaxies clas¬ 
sified as such also on the deep mass maps - and the 
fraction of galaxies which are instead misclassified as a 
consequence of the lower S/N - i.e. galaxies classified as 
mergers that are identified as not-merging on the deep 
HUDF-I-HUDF12 maps or iF—band images, and vice 
versa. 

We show the results of this calculation in Figure 
m The number of identified mergers is small and 


thus the error bars relatively large, however the figure 
shows that by applying the relations in Equation [2] the 
fraction of “misclassified mergers” in the mass selected 
sample is consistent with being ~ 20% for mass maps 
obtained from both the HUDF -|- GANDELS-Deep and 
GOODS -I- GANDELS-Deep combinations. Gonversely, 
for the structural indices Ah-band and M2o,h-band 
the contamination of false mergers remains as high as 
^ 50% for both the HUDF and GANDELS observations, 
even when imposing the condition in Equation[2j As wit¬ 
nessed by the increase in the fraction of galaxies that are 
misclassified as disks in Figure [121 the drawback of the 
refined selection is that a larger number of mergers are 
missed from the selection at the shallower depths. 

Summarizing the above findings, the availability of 
deep photometry is certainly necessary for maximizing 
the accuracy and completeness of the mass based classi¬ 
fication. Nonetheless, applying small corrections to our 
simulation-motivated selection of mergers reliable merg¬ 
ers samples can be identified also on medium depth im¬ 
ages, albeit reaching a lower completeness. Most no¬ 
tably, independently of whether Equations [T] or the re¬ 
fined relation in [2] are applied to the Ah-band and 
M 2 o,h-band indices, we find the iF—band classification 
results in a twice as large contamination of clumpy disks 
in the merger sample at any depth here considered. 

We identify our final sample of merger candidates 
among the HUDF sample as those galaxies that satisfy 
the selection criteria in Equation [2] even when account¬ 
ing for the error in the measurements (i.e., have error 
bars above those relations). The classification is per¬ 
formed on the deep HUDF -|-HUDF 12 mass maps when 
available and on the HUDF -|- GANDELS-Deep maps for 
those galaxies with no HUDF12 coverage. We find a total 
of 11 such candidates over the 87 galaxies with reliable 
mass maps. For the latter, we show in Figure HH the 
bzH, iF—band and mass stamps. Among this sample of 
mergers are included one galaxy with an inconsistent vi¬ 
sual class and also one object that has been flagged as 
possible chance projection in Section [421 (ID 15844, but 
note that even excluding the secondary blue clump this 
galaxy would still be classified as a merger), suggesting 
again a 20% contamination level. 

7.3. Mergers Missed in the H—band 

Thus far we have focused on those galaxies which are 
misclassified as mergers in the FF—band because of the 
presence of bright clumps. It is however interesting to 
test whether also the opposite occurs, i.e., whether there 
are galaxies classified as mergers in the structural anal¬ 
ysis performed on the mass maps that instead appear 
smooth(er) in the FF—band, for example as a result of 
dust lanes or intrinsically high mass-to-light ratios. Fig¬ 
ure [12] suggests that this happens for roughly 10% of the 
“FF-band smooth” galaxies. In our sample of 11 mass- 
selected merger candidates we find five galaxies (IDs 
9704, 11388, 12624, 14533 and 15432) that fall below the 
dotted lines in Figure [8| when using M 2 o,h-band and 
Ah-band and hence would not be classified as mergers 
in the FF—band. 

To understand the origin of this discrepancy, we gen¬ 
erated new versions of the mass maps for these galax¬ 
ies obtained by applying no dust extinction during the 
pixel-by-pixel SED fitting. This gives us an indication 
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Figure 13. As in Figure ITTI but in this case showing galaxies which satisfying the relations in Equation [2] even when accounting for errors 
in the measurements and hence are candidate mergers according to our definition. 


of whether the mass asymmetry is driven by the dust 
correction applied to the templates. IDs 9704 and 14533 
display significant asymmetries also in these mass maps, 
indicating that the classification is independent of the 
extinction law assumed. IDs 12624, 11388 and 15432 in¬ 
stead would not satisfy our selection criteria with dust- 
free mass maps. We note that both IDs 12624 and 
15432 are detected in the FIR with total IR luminosities 
Lm > IO^^Lq, implying a substantial amount of dust 
in these galaxies leading us to favor the results obtained 
including dust corrections. 

While it is clear that additional, e.g., kinematic, in¬ 
formation is required for a complete characterization of 
the population of “mass-smooth and H-clumpy” galax¬ 
ies, the available data suggest that most of these galaxies 
have distinct substructure and that dust obscuration can 
explain some of the differences between the i7-band and 
mass morphology. 


8. SUMMARY AND CONCLUSIONS 

Using a sample of about 100 HUDF galaxies with 
1.5 ^ z ^ 3, we have compared a morphological clas¬ 
sification of merging galaxies based on non parametric 
structural indices derived on resolved stellar mass maps 
with a canonical classification obtained from iJ—band 
measurements. We tested this approach using photome¬ 
try at the different depths available on the HUDF area 
(GOODS, CANDELS and HUDF/HUDF12) and, per¬ 
forming tests on artihcial stellar mass distributions, we 
derived luminosity and size limits for which reliable mass 
maps can be obtained. The selection of mergers in the 
mass domain is calibrated using a sample of isolated 
and merging galaxies from the MIRAGE hydrodynam- 
ical simulations which have been post-processed and an¬ 
alyzed to closely reproduce the observational data. 

We summarize our findings as follows: 


1. as also discussed in previous works (|Wuvts et al.l 
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I2012D . the stellar mass maps morphologies cannot 
be reproduced by i/—band data alone which of¬ 
ten display merger-like features even for galaxies 
with perfectly smooth, disk-like mass profiles. Al¬ 
though some of these galaxies display a red nucleus 
indicative of a central mass concentration, even a 
combined visual inspection of multiple bands is still 
affected by the presence of star-forming clumps and 
does not provide a full proxy for the mass distri¬ 
bution. As a consequence of the above, we quan¬ 
tify that merger samples identified on the basis of 
asymmetry/irregularity in the i/—band images can 
have a contamination from clumpy galaxies as high 
as 50%. Our analysis also suggests that differences 
between the mass- and i7-band selection may orig¬ 
inate from a population of galaxies with smoother 
i/-band images than mass distribution, possibly as 
a result of dust extinction. 

2. On both the data and the simulations, we find 
that a combination of Amass and M2o,mass rnea- 
sured on the stellar mass maps is instead most 
cleanly separating major mergers from isolated, 
clumpy galaxies. We use the results from the 
simulations and the analysis on the real HUDF 
galaxies to provide in Equations [T] and [5] quanti¬ 
tative criteria to separate the galaxy population 
into mergers and disks using mass maps derived at 
the HUDF-HHUDF12, HUDF-h CANDELS-Deep 
and GOODS -I- CANDELS-Deep depths. Applying 
these criteria, we estimate that chance projections 
or the scattering of clumpy disks in the merger sam¬ 
ple by noise in the mass maps result in a contam¬ 
ination of roughly 20%. When applied to galax¬ 
ies with available kinematic data, our Amass and 
Af 20 ,MASS based classification results in morpholo¬ 
gies that are consistent with the kinematic classes. 

3. The ability of the mass-based Amass and M2o,mass 
indices to select true mergers is not significantly 
affected by a moderate decrease in the S/N of the 
parent images if the conditions in Equation [5] are 
used. Gonversely, the iJ-band classification re¬ 
sult in a twice as large contamination from clumpy 
disks, independently of the criteria that are ap¬ 
plied. 

4. Erom the analysis of the MIRAGE simulated merg¬ 
ers we estimate the proposed Amass: Af2o,MASS se¬ 
lection should be sensitive to major mergers be¬ 
tween 300 Myr and 50 Myr before coalescence. 

Based on the above results, we thus suggest that to 
identify major mergers a classification in the mass do¬ 
main rather than from optical/NIR images should be pre¬ 
ferred and performed whenever possible. Obviously, our 
technique is only sensitive to merger phases in which the 
perturbation in the mass profiles is measurable (roughly 
up to 300 Myr before coalescence as specified above) and, 
as also a consequence of imposing a high purity in the 
resulting merger sample, is strongly biased against ear¬ 
lier merger stages. A combination with other techniques 
(e.g., close kinematic pair selection) would be hence nec¬ 
essary for a full census of merging systems. 


Einally, it is possible that variations in the gas fraction 
of galaxies and the actual masses of giant clumps could 
introduce some redshift dependent scaling of the thresh¬ 
old here derived, moving the locus of clumpy disks toward 
slightly higher Amass and/or M2o,mass values at higher 
redshift. However, we expect that these variations would 
be comparable to the uncertainties in the mass maps and 
the derived structural parameters and hence would not 
strongly affect the proposed classification for sufficiently 
large samples. 
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APPENDIX 

A. RELIABILITY OF THE MASS MAPS AND DERIVED PARAMETERS 
A.l. Comparison with Integrated Masses 

As a basic consistency check of the mass maps for the sample described in Section 12.31 we show in Figure lAll the 
comparison between the sum of the masses in each individual pixel of the galaxy and the total galaxy mass that is 
obtained from the integrated photometry. Although for highly obscured or strongly star-forming galaxies differences 
between the two estimates can be expected on an object by object basis due to the patchy distribution of dust, strong 
disagreement between the resolved and integrated estimates would be an indication of biases in the pixel-by-pixel SED 
fits. We instead find a very good agreement between the pixel-based total mass and the whole galaxy mass, with a 
median difference which is less than 0.1 dex. 



Io§lo(^global-galaxy/^o) 


Figure Al. For the 132 galaxies with 1.5 ^2^3 presented in Section 12.31 we show the comparison between the galaxy mass obtained 
from SED fitting to the integrated photometry {Mgichal—galaxy) and the sum of the masses in each individual pixel in the mass maps 
(t^pixels)- On the top right corner of the Figure we provide the median difference between the two estimates for the entire sample of 
galaxies here considered. 


A. 2 . Description of the Models used for Testing the Mass Reconstruction 

As mentioned in Section r2.51 the resolution and noise of the parent observations set major limitations for the estimate 
of the stellar mass maps. In order to derive the size and magnitude limits for which a reliable mass map can be obtained, 
we tested the pixel-based SED fitting on a sample of artificial galaxies with known mass distribution. To perform the 
test on model galaxies which are representative of the typical real galaxies, the models were drawn from the observed 
sample as follows. 

For each real HUDF galaxy in the initial sample presented in Section 12.31 we generated a toy mass distribution 
characterized by a Sersic profile with a total mass equal to t he galaxy integrated mass and structural parameters 
determined by the best-fit i/—band GALFIT (iPeng et al.ll201fl(l model. To densely populate the magnitude-size plane, 
we also generated extra models by adding random perturbations on the original galaxies parameters. We then imposed 
each pixel in the artificial galaxy to have an SED equal to the observed best-fit SED for the entire galaxy (including 
a uniform extinction for all pixels) and used this SED to predict the pixel fluxes. 

From these flux maps we generated artificial HST images which we pasted into blank sky regions to mimic the typical 
S/N properties reached by the real galaxies with HUDF optical plus GANDELS-Deep NIR coverage. This depth is 
in between the two extreme combination of imaging (deep HUDE -I-HUDE12 or shallow GOODS -I- GANDELS-Deep) 
used for creating the mass maps (see Section [TT]). Specifically, we created stamps at the HUDF depth for the F435W, 
F606W, F775W and F850LP filters, while the F814W, F105W, F125W and F160W images were matched to the 
GANDELS-Deep depth. All images were degraded to the resolution of the E160W filter as in the real sample. 

We finally computed “observed” mass maps form these artificial images as described in Section [2.41 i.e., by running 
Adaptsmooth, extracting pixel-based SEDs and fitting them with LePhare and used these “observed” mass maps 
for our tests in Section [2.51 Although these simulations are undoubtedly a simplification of reality which is complicated 
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by pixel-by-pixel variations of the SED, dust content, etc. they provide us with a measure of systematic biases in the 
mass estimation. 

A.3. The Choice of Classifieation Parameters: Diseriminating Power and S/N Ejfeets 

Together with the M 20 and A indices described in Section [5Tl we also evaluated the following structural indicators: 

1. The concentration C defined as the logmithmic ratio o f the radii containing 20% and 80% of the total flux 
C = 5 Iog(r 8 o/'r 2 o) (|Bershadv et al.l[2000t lConselicell2003ll . 

2. The Gini coefficient (jAbraham et al.l[2003l : iLotz et al.ll2004D . which describes the uniformity of the flux distribu¬ 
tion on a scale between 0 (all pixels with equal flux) and 1 (all flux in just one pixel). 

3. And finally the multiplicity ip that quantifies the presence of multiple components through a comparison between 
the original im age and a resamp led version in which the pixels are re-arranged in decreasing flux order from the 
br ightest pixel (ILaw et al.ll2007f l. For this latter parameter we slightly modified the original definition proposed 
in lLaw et al.l (|2007l l to account for ellipsoidal light/mass distribution. 

We present in Figure [551 the analogous of Figure [7] but this time considering planes that include also C, Gini and ip. 
On one hand, the figure reinforces the results of Sections [6] and [T] even for these different combinations of structural 
indicators, the contamination in a mass-based selection of mergers is smaller than that obtained with measurements 
performed on the F160W and F850LP images. On the other, it justifies the choice of M 2 o,masS“^mass as our fiducial 
combination: it can be noticed in fact that these other sets of indices result in either a higher contamination or 
lower completeness with respect to a classification based on M 2 o,masS“^mass (see lower panels of Figure IA2I and 
Figure [7]). An exception is the GiniMASS“^MASS combination which produces comparable results to those obtained 
for M 2 o,masS“^mass- We show in the following that the Gini coefficient is however more sensitive to the choice of 
the aperture used in the calculation and the noise in the image than the M 20 or A indices and hence a less stable 
structural measurement. 

This is illustrated in Figure |X3] where study the variation of the structural indices with image depth and aperture size 
used in the calculation. We only discuss the comparison for the structural indices measured on the mass maps obtained 
from the combinations of HUDF -I-HUDF12, HUDF -|-GANDELS-Deep and GOODS-I-GANDELS-Deep photometry. 
Nonetheless, we also present the results for the indices derived on the i7—band at HUDF or GANDELS depth for 
completeness of information. Specifically, we considered those galaxies (50) with rkron > 5xPSF and H ^24.5 that 
have coverage in the HUDF, GANDELS as well as GOODS fields and calculated the structural indices at all depths, 
within either a Kron or Petrosian aperture. The use of two different apertures allows us to further test the impact 
of the noise in the images: the Kron aperture is usually larger than the Petrosian radiu^ (see Table Ell) and hence 
includes a higher number of low flux pixels; differences between the two estimates can be used to assess the stability of 
the measurement. This is also the reaso n why we opted for a Petrosian aperture in the calculation of the parameters 
in Section [ 53 ] The points in Figure EH show the median values of the structural indices over all the 50 galaxies here 
considered. 

A reliable measure of structure should show as little variation as possible with both imaging depth and aperture 
size in Figure ESI The asymmetry Amass is in this sense very robust with median values that remain almost identical 
at all depths and for both apertures. This should be compared, for example, with the results for the Gini coefficient 
in the right-most column which clearly sho ws a st r ong d ependence on the aperture size with differences up to about 
30%. As already pointed out in the work of iLiskeil (I2008H . this is a consequence of the fact that Gini becomes a tracer 
of the noise in the images rather than galactic structure for large apertures. The other parameters, M 2 o,masSj 'I'mass 
and CmasSj all show some dependence on the depth of the images used for constructing the maps, shifting towards 
lower concentrations/higher dumpiness when using the GOODS-I-GANDELS-Deep photometry instead of the deep 
HUDF -I-HUDF12 imaging. The largest variations are measured for the multiplicity 4'mass (more than 50% change 
among the various cases), whereas the M 2 o,mass coefficient varies by ^ 15%. _ 

In combination with Figure [71 these findings justify our choice of Amass- Tf 2 o,MASS in Section 16.21 as the most 
powerful and less noise- or aperture-dependent parameter set for the discrimination of merging and clumpy galaxies. 


We note that the Petrosian aperture considered here is the 
actual Petrosian radius, i.e., not the default SExtractor aperture 


equal to ^y^Rpetrosian- 
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Figure A2. As in Figure [7| but this time also considering the structural parameters C, Gini and ■0. Top panel block: Relation between 
different structural indices for the sample of MIRAGE simulated mergers and isolated disks. From top to bottom measurements are 
performed on the artificial HST F435W, F850LP, F160W images and on the mass map as indicated in the labels. The dashed lines 
highlight the best dividing relation between mergers and disks (maximum margi n classifier) obtained with a SVM approach. In the Gini- 
M 20 plane, we also show for reference the merger threshold of ILotz et al.l (J2003) with a thin dotted line. Our results remain unchanged 
if using this relation instead of that derived from the SVM algorithm. Lower panel block: For each of the selection criteria in the upper 
panels, we plot the contamination from isolated disks in the merger sample (snapshots falling above dotted line) and the completeness in 
the selected sample of mergers. All mergers independently of their ratio are here considered. Golors and symbols are as in Figure [7| 
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Figure A3. Comparison between the structural indices (A, M 2 O) C and Gini coefficient) measured on images of different depths and 
within different apertures. Points in the figure correspond to the median values of the structural indices over all 50 galaxies that have 
'f'kron > 5xPSF , H ^24.5 and coverage in the HUDF, CANDELS as well as GOODS imaging. The black and red points are the medians 
obtained within either a Petrosian or a Kron elliptical aperture, respectively. The error bars indicate the 16th and 84th percentiles of the 
distributions. Top panels: structural indices derived from the H—band images extracted from the HUDF and CANDELS fields, as shown 
in the y-axis label. Bottom panels: structural indices measured on the mass maps obtained from a combination of HUDF + HUDF12, 
HUDF + CANDELS-Deep and GOODS -h CANDELS-Deep photometry, as shown in the y-axis label. 
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ID BzH H-band Mass RA[J2000] DEC[J2000] z -M2 o,mass 6-Af2o,MASS ^mass ^Mmass Class 


8740 


8750 


9295 


9343 


9407 


9474 


9527 


9835 


9987 



53.16764 -27.83037 1.88 -1.670 0.065 0.114 0.010 Not merging 

53.16287 -27.82947 2.04 -1.220 0.125 0.125 0.040 Unres./Faint 

53.17170 -27.82566 1.73 -1.850 0.070 0.096 0.010 Not merging 

53.16714 -27.82450 1.84 -1.210 0.095 0.160 0.014 Not merging 

53.17061 -27.82379 2.69'* -0.610 0.070 0.460 0.186 Merger (2) 

53.16978 -27.82394 2.24 -1.260 0.055 0.112 0.021 Not merging 

53.15673 -27.82306 1.72 -1.950 0.085 0.036 0.013 Not merging 

53.17343 -27.82028 2.28 -1.320 0.105 0.036 0.078 Unres./Faint 

53.14894 -27.81928 2.23 -1.180 0.085 0.183 0.049 Not merging 


Note. — Fo r all galaxies in the extended sample of Section [2.31 we provide: (1) the CANDELS serial number from the 
iGuo et aJ] (1201,31 ! catalog; (2) the BzH composite image; (3) the H-band image; (4) the mass maps; (5)-(6) RA and DEC in 
J2000; (7) photometric or spectroscopic redshift (galaxies flagged with “s” have a spectroscopic redshift); (8)-(9) Asymmetry 
index Amass measured from the mass map and associated error; (lO)-(ll) M2o,mass index measured from the mass map and 
associated error; (12) classification according to the structural measurements performed on the mass maps: galaxies flagged as 
“Merger (1)” are merger candidates selected with the criterium in Equation [T] whereas galaxies flagged as “Merger (2)” also 
satisfy the depth-dependent selection in Equation[2] Galaxies flagged as Unres./Faint are below our limits of tkro jv > 5 x PSF 
and/or H ^ 24.5 for a reliable mass estimate. All images are 3" wide. Galaxies highlighted with a red exclamation mark are 
possible chance projections based on the analysis in Section (4.21 and those with a symbol have photometry and mass maps 
strongly contaminated by neighbouring objects. The red and white ellipses in the RGB images show the Kron and Petrosian 
radius (used in the calculation of the structural indices from the mass maps), respectively. 

[The complete table can be found at: http://www.phys.susx.ac.uk/~ac625/Tablel_full.pdf 















